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CLASS II DNA METHYLTRANSFERASES OF ZEA MAYS 



FIELD OF THE INVENTION 
5 The present invention relates to nucleic acid and ammo acid sequences which 

encode class II DNA methyltransferases. The present invention further relates to 
methods of using the nucleic acid and ammo acid sequences described herein to 
stabilize transgene expression in transgenic plants, to alter the yield or biochemical 
qualities of plants and to silence targeted genes m plants in vivo. 

10 

BACKGROUND OF THE INVENTION 
The information content of a primary DNA sequence can be enhanced by the 
addition of a methyl group to the ring structure of cytosine or adenine residues 
(Finnegan. E.J., et aL Anna. Rev Plant PhysioL Plant Moi Biol. 49:223-47 (1998)). 

15 The chemical modification of DNA is known to affect protein-DNA interactions. 
Specifically, in prokaryotes, methylation of DNA prevents cleavage by the cognate 
restriction endonucieases. Id. In higher eukaryotes, cytosme methylation can inhibit 
binding of regulatory proteins and methylation of promoter and coding sequences of 
genes can repress transcription, both in vitro and in vivo. Id. Methylation of DNA 

20 has been implicated in the timing of DNA replication, in determination of chromatin 
structure, in increasing mutation frequency, as a causal agent for some human 
diseases, and as a basis for epigenetic phenomena. Id. 

Eukaryotic genomes are not methylated uniformly, but instead contain specific 
25 methylated regions, with other domains remaining unmethylated (Martienssen. R.A., 
et ah. Current Opinion in Genetics and Development. 5:234-242 (1995)). The 
enzymes that transfer methyl groups to the cytosine ring are cytosme-5- 
methyitransferases (hereinafter referred to as "DNA methyltransferases* , ) and have 
been characterized from a number of eukaroytes. All characterized eukaryotic DNA 
30 methyltransferases exhibit little primary sequence specificity in vitro other than the 
short canonical symmetrical sites methylated which are CpG in animals, and CpG and 
CpNpG in plants (where N stands for any nucleotide). Mammalian and plant 
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genomes contain methylation-free GC-rich zones, or CpG islands, which are 
frequently associated with the 5" regions of housekeeping genes. Id. 

in plants, DN A methylation is necessary for normal development. For 
5 example, Arabidopsis having reduced levels of DN A methylation demonstrate a range 
of abnormalities, including loss of apical dominance, reduced stature, altered leaf size 
and shape, reduced root length, homeotic transformation of floral organs and reduced 
fertility (Finnegan. E J., et ah, Anna. Rev. Plant Physiol Plant Mol Biol 49:223-47 
(1998)). Moreover. Arabidopsis plants in which methylation had been reduced by at 
10 least 70% became infertile after four to five generations of selfing. Id A comparable 
reduction m DNA methylation is embryo lethal in mammals. Id. 

Two classes of DNA methyltransferase enzymes have been cloned in plants 
(Finnegan. EJL et ah. Annu. Rev. Plant Physiol Plant Mol Biol 49:223-47 (1998)) - 

15 class I and class II. Class 1 enzymes include Metl and Metll from Arabidopsis 

(Finnegan et al Nucleic Acids Res., 2 1 ( 10):2383-2388 (1993); Nebendahl, et ah, Gene 
157(l-2):269-272 (1995)), Met 1-5 and Met2-21 from carrot (Bernacchia, G et al. 
Plant Phvsiol 1 16:446-446 (1998)), C-5 MTase from tomato (Bernacchia, G et al. 
Plant J., 13(3):317-330 (1998)), and C-5 MTase from pea (Pradhan et al, Nucleic 

20 Acids Res.. 26(5):1214-1222 (1998)). Class II sequences have been detected in many 
species with a defining charactenstic of the presence of an embedded chromodomain 
(Rose et al.. Nucleic Acids Res., 26(7); 1 628- 1 635 (1998)). The only full-length class 
II sequence is Cmtl from Arabidopsis (Genbank ??AF039364). 

25 Class I enzymes are homologous to dnmtl from mice (Bestor, T.. et al., 

EMBOJ., 1 i(7):261 1-2617 (1988)), the first cloned DNA methyltransferase. A 
knockout of dnmtl m mice resulted in lethality during embryogenesis (Li et ah, Cell, 
69(6):9 15-926 (1992)). Dnmtl has been used as a model for all class I enzymes 
though it has not been proven whether this is appropriate m plant systems. Antisense 

30 expression of Metl in Arabidopsis resulted in numerous developmental abnormalities 
(Finnegan et al.. Proc. Natl Acad. Scl U.S.A., 93(16):8449-8454 (1996)). Class I 
enzymes are thought to function as maintenance enzymes, though proteolytic 
cleavage could create de novo enzymes (Bestor, T.H., EMBOJ., 1 1(7):261 1-2617 
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(1992)). CpG activity has been shown for dnmtt m mice and humans. In peas it was 
found that pea C-5 MTase expressed m baculovirus displayed both CpG and CpNpG 
activity (Pradhan et ah, Nucleic Acids Res., 26(5): 1214-1222 (1998)). In general 
class I enzymes have a high level of expression in tissues that are actively dividing 
5 and are expressed at lower levels or silent in mature tissues. 

There is little known regarding the function of class II enzymes. Cmtl was 
detected as an Arabidopsis genomic sequence based on sequence homology to other 
methyltransferases. The C-termmal region contains the conserved methyltransferase 

10 domains and a chromodomain. The N-terminal region is much shorter than the N- 
termmal region of class I enzymes. Several commonly used ecotypes of Arabidopsis 
contain an allele of Cmtl which is interrupted by a transposon insertion. These Cmtl 
knockouts do not have any detectable phenotype. No other research has been 
published on the function of class II enzymes. Cmtl is expressed only m floral tissues 

15 at very low levels. Degenerate PCR has been used to show the presence of Cmtl 
homologs in a number of other plant species (Rose et ah, Nucleic Acids Res., 
26(7): 1628-1635 (1998)). In addition to finding homologs in other species, two 
sequences with similarity to Cmtl, Cmt2 and Cmt3, were identified in the 
Arabidopsis. 

20 

DNA methylation provides a mechanism for the mitotic propagation of 
epigenetic states. Epigenetic lineage-dependent patterns of gene expression have 
been studied the most m the germline and in somatic cell lineages m multicellular 
eukaryotes (Martienssen, R.A., et aL Curr. Opin. Genet, arid Develop,, 5:234-242 

25 (1995)). For example, in mice, the parentally imprinted genes HI 9 and Igflr are 

expressed in the embryo only when they are inherited via the female gamete. Id. In 
contrast, the Igf2 gene is expressed only when inherited via the male gamete. Id. The 
human homologs of the Ig/2 and HI 9 genes are linked and parentally imprinted as in 
the mouse. Id. Parental uniparental disomy for this chromosomal region (1 lpl5) is 

30 associated with Beckwith-Wiedemann syndrome, which is believed to result from 
overexpression of Igf2. Id. in addition to overgrowth of certain organs, Beckwith- 
Wiedemann syndrome patients have a 700-fold predisposition to Wilms' tumor, and 
loss of heterozygosity in this region is found in many other tumors as well. Id. It has 
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also been shown that 60-70% of Wilms' tumor patients have biallelic expression of 
lg/2> H t 9 * or both in tumor tissue, resulting from loss of imprinting rather than loss of 
heterozygosity- Id. 

5 In plants, epigenetic changes in gene expression are considered to be easier to 

observe than in animals since there is little cell migration and clonal lineages stay 
together. Id. Moreover, because m plants the germline arises relatively late m 
development, many somatically variegated phenotypes can be followed into the next 
generation and are heritable to greater or lesser extents. Id. Parental imprinting of 

10 gene expression was first observed in plants at the R locus in maize. Id. Certain 
alleles condition a mottled phenotype in the alerone layer of the extra-embryonic 
endosperm when inherited paternally, but cause a fully colored phenotype when 
inherited maternally. Id. Genetic studies of modifier loci have revealed that it is the 
maternally inherited R allele that is imprinted to a high level of expression. Id. High 

15 levels of R expression correlate with demethylation of sites in the transcribed region 
in the maternally inherited allele. Id. 

Plants transformed with additional copies of endogenous genes or with 
multiple copies of a foreign or exogenous gene (these endogenous and exogenous 

20 genes are often referred to as "transgenes") frequently display epigenetic inactivation. 
This phenomenon is known as "gene silencing" or "co-suppression". There are two 
types of "gene silencing" or "co-suppression". The first is "transcriptional 
silencing". In "transcriptional silencing", RNA production from the introduced 
transgene is repressed. The second type of "gene silencing" is "posttranscriptional 

25 silencing", in "posttranscriptional silencing", transcripts do not accumulate in the 
cytoplasm even though transcription rates are comparable with or are higher than 
those in cells where transcripts do accumulate. 

Transcriptional silencing is associated with transgene methylation, particularly 
30 in the promoter (Finnegan, E.J., et aL Annu. Rev. Plant Physiol. Plant Mol. Biol 

49:223-47 (1998)). Posttranscriptional silencing, which affects both transgenes and 
homologous endogeneous genes, is also associated with transgene methylation, but 
within the coding sequence rather than the promoter. Id. It is believed that both 
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forms of gene silencing reflect normal, cellular defenses against invading or mobile 
DNAs. Id. 

Currently, two classes of methyltransferase genes have been cloned m maize. 
5 The class I clone homolog is referred to as Zmetl and the class II homolog Zmet2. 
The Zmetl is a class I enzyme that was cloned by Paula Olhoft and Ron Phillips at the 
University of Minnesota. FIG. 4 is a summary of the major classes of 5-cytosme 
methyitransferases from mammals. Arabidposis and maize. The present invention 
herein relates to zmet2a and zmet2b methyitransferases. 

10 

SUMMARY OF THE INVENTION 
In one embodiment, the present invention relates to an isolated and purified 
lea mavs zmet2a methyltransferase nucleic acid sequence. Specifically, the isolated 
and purified Zea mays zmet2a methyltransferase nucleic acid sequence of the present 
15 invention hybridizes to the nucleic acid sequences shown in FIG. 1 A and IB under 
stringent conditions. The zmet2a methyltransferase nucleic acid sequence encodes 
the enzyme zmet2a methyltransferase. The amino acid sequences for zmet2a 
methyltransferase is shown in FIG. 2A and FIG. 2B. 

20 In another embodiment, the present invention further relates to recombinant 

expression cassettes comprising the isolated and purified zmet2a nucleic acid 
sequence described herein. Preferably, the recombinant expression cassettes further 
contain a promoter sequence and a polyadenylation signal sequence. The promoter 
sequence can be operabiy linked to the zmet2a nucleic acid sequence. The zmet2a 

IS nucleic acid sequence is operabiy linked to the polyadenylation signal sequence. Any 
promoter sequence can be used in the recombinant expression cassette, such as, but 
not limited to a constitutive or tissue specific promoter. 

In another embodiment, the present invention also relates to a recombinant 
0 expression cassettes comprising one or more heterologous nucleic acid sequences. 
Such recombinant expression cassettes further contain a promoter sequence from the 
zmet2a nucleic acid sequence and a polyadenylation signal sequence. The promoter 
sequence is operabiy linked to the heterologous nucleic acid sequence. The 
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heterologous nucieic acid sequence is operabiy linked to the polyadenylation signal 
sequence. Any heterologous promoter sequence can be used in this recombinant 
expression cassette. 

In a further embodiment, the present invention also relates to bacterial cells 
comprising at least one of the recombinant expression cassettes described herein. The 
bacterial cells can be Agrobactenum tumefaciens or Agrobacterium rhizogenes. 



In a further embodiment, the present invention further relates to transgenic 
10 plant cells and transgenic plants containing the recombinant expression cassettes 
described herein, Monocotyledonous or dicotyledonous plant cells and plants can be 
transformed with the hereinbefore described recombinant expression cassettes. Plants 
which can be transformed with the recombinant expression cassettes of the present 
invention include, but are not limited to, Zea mays, Ofyza sativa. Secale cereale, 
15 Triticum aestivum, Daucus carota, Brassica oleracea, Cucumis melo, Cucumis 
sativus, Latuca sativa, Solarium tabersoum, Lycopersicon esculentum, Phaseolus 
vulgaris, Brassica napus, etc. The present invention also relates to seed resulting 
from the transgenic plants of the present invention. 



20 In a further embodiment, the present invention further provides methods of 

reducing or altering methyltransferase activity m a transgenic plant in order to 
increase transgene expression stability and/or to improve the yield or biochemical 
qualities of a plant as well as a method of silencing targeted genes in a plant in vivo. 
These methods comprise introducing into a plant a recombinant expression cassette 

25 comprising an appropriate plant promoter operabiy linked to a zmet2a 

methyltransferase nucleic acid sequence described herein m either the sense or 
antisense direction. 



In a further embodiment, the present invention relates to an isolated and 
30 purified Zea mays zmet2b methyltransferase nucleic acid sequence. The zmet2b 

methyltransferase nucleic acid sequence of the present invention can be isolated using 
an isolated and purified partial Zea mays zmet2b methyltransferase nucleic acid 
sequence. The isolated and purified partial Zea mays zmeb2b methyltransferase 

6 
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nucleic acid sequence can be used as a probe to isolate the zmet2b methyltransferase 
nucleic acid encoding zmet2b methyltransferase. Preferably, the isolated and purified 
partial Zea mays zmet2b methyltransferase nucleic acid described herein hybridizes to 
FIG. 23 under stringent conditions. The partial zmet2b methyltransferase nucleic acid 

5 sequence described herein encodes a portion of zmet2b methyltransferase. The partial 
amino acid sequence of zmet2b methyltransferase is shown m FIG. 24. The zmet2b 
methyltransferase nucleic acid sequence can be used in recombinant expression 
cassettes in the same manner as the isolated and purified zmet2a nucleic acid 
sequence described herein. Such recombinant expression cassettes can be used to 

10 create transgenic plants containing these recombinant expression cassettes. 

Additionally, the zmet2b methyltransferase nucleic acid sequence can be used to 
reduce or alter methyltransferase activity in transgenic plants in the same manner as 
the zmet2a methyltransferase nucleic acid sequence. 

15 Definitions 

Units, prefixes, and symbols can be denoted in the SI accepted form. Numeric 
ranges are inclusive of the numbers defining the range. Unless otherwise indicated, 
nucleic acids are written left to right in 5 1 to 3^ orientation, respectively. The 
headings provided herein are not limitations of the various aspects or embodiments of 
20 the invention which can be had by reference to the specification as a whole. 
Accordingly, the terms defined immediately below are more fully defined by 
reference to the specification as a whole. 

As used herein, the term "plant" includes reference to whole plants, plant 
25 organs (e.g., leaves, stems, roots, etc.), seeds and plant cells and progeny thereof The 
class of plants which can be used in the methods of the present invention are generally 
as broad as the class of higher plants amenable to transformation techniques, 
including both monocotyledonous and dicotyledonous plants. 

30 As used herein, "heterologous" when used to describe nucleic acids or 

polypeptides refers to nucleic acids or polypeptides that originate from a foreign 
species, or, if from the same species, are substantially modified from their original 
form. For example, a promoter operably linked to a heterologous structural gene is 
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from a species different from that from which the structural gene was derived, or, if 
from the same species, one or both are substantially modified from their ongina] 
form. 

5 A nucleic acid or polypeptide is "exogenous to" an individual plant is one 

which is introduced into the plant by any means other than by a sexual cross. 
Examples of means by which this can be accomplished are described below, and 
include Agrobacterium-mediaXed transformation, biolistic methods, eiectroporation, 
and the like. Such a plant containing the exogenous nucleic acid is referred to herein 
10 as an Ri generation transgenic plant. Transgenic plants which arise from sexual cross 
or by selfing are descendants of such a plant. 

As used herein. * k zmet2a methyltransferase gene'' or "zmet2a 
methyltransferase nucleic acid" refers to a nucleic acid encoding zmet2a 

15 methyltransferase and which hybridizes under stringent conditions and/or has at least 
60% sequence identity at the deduced amino acid level to the exemplified sequences 
provided herein. The zmet2a polypeptide encoded by the zmet2a methyltransferase 
gene has at least 55% or 60% sequence identity, typically at least 65% sequence 
identity, preferably at least 70% sequence identity, often at least 75% sequence 

20 identity, more preferably at least 80% sequence identity, and most preferably at least 
90% sequence identity at the deduced amino acid level relative to the exemplary 
zmet2a methyltransferase sequences provided herein. 

As used herein, "zmet2a methyltransferase nucleic acid" includes reference to 
25 a contiguous sequence from a zmet2a methyltransferase gene of at least 2454 

nucleotides m length. In some embodiments the nucleic acid is preferably at least 
2736 nucleotides in length (see FIG. IA) and more preferably at least 2796 
nucleotides in length (see FIG. IB). 

30 As used herein. "zmet2b methyltransferase gene" or tv zmet2b 

methyltransferase nucleic acid" refers to a nucleic acid encoding zmet2b 
methyltransferase and which can be identified using the partial zmet2b 
methyltransferase nucleic acid shown m FIG. 23. The zmet2b methyltransferase gene 

8 
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hybridizes under stringent conditions to the partial zmet2b methyltransferase nucleic 
acid shown in FIG. 23. 

As used herein, Lt a partial zmet2b methyltransferase nucleic acid" includes 
reference to a contiguous sequence of at least 1181 nucleotides in length and which is 
from the zmet2b methyltransferase gene. 

As used herein, "isolated" includes reference to material which is substantially 
or essentially free from components which normally accompany or interact with it as 
found in its naturally occurring environment. The isolated material optionally 
comprises material not found with the material in its natural environment. 

As used herein, "nucleic acid" includes reference to a deoxyribonucleotide or 
ribonucleotide polymer m either single- or double-stranded form, and unless 
otherwise limited, encompasses known analogues of natural nucleotides that hybridize 
to nucleic acids in a manner similar to naturally occurring nucleotides. Unless 
otherwise indicated, a particular nucleic acid sequence includes the complementary 
sequence thereof. 

As used herein, "operably linked" includes reference to a functional linkage 
between a promoter and a second sequence, wherein the promoter sequence initiates 
and mediates transcription of the DNA sequence corresponding to the second 
sequence. Generally, operably linked means that the nucleic acid sequences being 
linked are contiguous and, where necessary to joint two protein coding regions, 
contiguous and in the same reading frame. 

In the expression of transgenes, one of ordinary skill in the art will recognize 
that the inserted nucleic acid sequence need not be identical and may be "substantially 
identical" to a sequence of the gene from which it was derived. As explained below, 
these variants are specifically covered by this term. 

In the case where the inserted nucleic acid sequence is transcribed and 
translated to produce a functional zmet2a and/or zmet2b methyltransferase 



WO 00/53732 



PCT/USOO/06456 



polypeptide, one of ordinary skill in the art will recognize that because of codon 
degeneracy a number of nucleic acid sequences w ill encode the same polypeptide. 
These variants are specifically covered by the term ' zmet2a methyltransferase nucleic 
acid sequence" or "zmet2b methyltransferase nucleic acid sequence". In addition, the 
5 term specifically includes those full length sequences substantially identical 

(determined as described below) with a zmet2a and/or zmet 2b methyltransferase 
gene sequence which encode proteins that retain the function of the zmet2a and/or 
zmet2b methyltransferase. Thus, m the case of the zmet2a and/or zmet2b 
methyltransferase genes described herein, the term includes variant nucleic acid 
10 sequences which have substantial identity with the sequences disclosed herein and 
which encode proteins capable of reducing or regulating DNA methylation in a 
transgenic plant for various purposes as well as silencing target genes in a plant using 
the nucleic acid sequences described herein. 

15 Two nucleic acids or polypeptides are said to be "identical" if the sequence of 

nucleotides or amino acid residues, respectively, in the two sequences is the same 
when aligned for maximum correspondence as described below. The term 
"complementary to" is used herein to mean that the complementary sequence is 
identical to all or a specified contiguous portion of a reference nucleic acid sequence. 

20 Sequence comparisons between two (or more) nucleic acids or polypeptides are 

typically performed by comparing sequences of two optimally aligned sequences over 
a segment or "comparison window" to identify and compare local regions of sequence 
similarity. Optimal alignment of sequences for comparison may be conducted by the 
local homology algorithm of Smith and Waterman, Ad. App. Math. 2: 482 (1981), by 

25 the homology alignment algorithm of Neddleman and Wunsch. J. MoL Biol 48:443 
(1970), by the search for similarity method of Pearson and Lipman. Proc. Natl Acad. 
Sci. (U.S.A.) 85:2444 (1988), by computerized implementation of these algorithms 
(GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software 
Package, Genetics Computer Group (hereinafter "GCG"), 575 Science Dr., Madison, 

30 WI), or by inspection. 

"Percentage of sequence identity" is determined by comparing two optimally 
aligned sequences over a comparison window, where the portion of the nucleic acid 
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sequence in the comparison window may comprise additions or deletions (i.e., gaps) 
as compared to the reference sequence (which does not comprise additions or 
deletions) for optimal alignment of the two sequences The percentage is calculated 
by determining the number of positions at which the identical nucleic acid base or 
5 amino acid residue occurs in both sequences to yield the number of matched 

positions, dividing the number of matched positions by the total number of positions 
in the window of comparison and multiplying the result by 100 to yield the percentage 
of sequence identity. 

10 The term "substantial identity" of nucleic acid sequences means that a nucleic 

acid comprises a sequence that has at least 55% or 60% sequence identity, generally 
at least 65%, preferably at least 70%, often at least 75%. more preferably at least 80% 
and most preferably at least 90%, compared to a reference sequence using the 
programs described above (preferably BESTFIT) using standard parameters. One of 

15 ordinary skill in the art will recognize that these values can be appropriately adjusted 
to determine corresponding identity of proteins encoded by two nucleotide sequences 
by taking into account codon degeneracy, amino acid sequences for those purposes 
normally means sequence identity of at least 55% or 60%, preferably at least 70%, 
more preferably at least 80%, and most preferably at least 95%. Polypeptides having 

20 "sequence similarity" share sequences as noted above except that residue positions 
which are not identical may differ by conservative amino acid changes. Conservative 
amino acid substitutions refer to the interchangeability of residues having similar side 
chains. For example, a group of amino acids having aliphatic side chains is glycine, 
alanine, valine, leucine, and isoleucine; a group of amino acids having aliphatic- 

25 hydroxy! side chains is serine and threonine; a group of amino acids having aromatic 
side chains is phenylalanine, tyrosine, and tryptophan; a group of amino acids having 
basic side chains is lysine, argmine, and histidme; and a group of ammo acids having 
sulfur-containing side chains is cysteine and methionine. Preferred conservative 
ammo acids substitution groups are: valine-leucine-isoleucine, phenylalanine- 

30 tyrosine, lysine-arginme, alanine-valine, and asparagme-glutamine. 



Another indication that nucleic acid sequences are substantially identical is if 
two molecules hybridize to each other under appropriate conditions. Appropriate 
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conditions can be high or low stringency and will be different in different 
circumstances. Generally, stringent conditions are selected to be about 5°C to about 
20°C lower than the thermal melting point (T m ) for the specific sequence at a defined 
ionic strength and pH. The T m is the temperature (under defined ionic strength and 
5 pH 0) at which 50% of the target sequence hybridizes to a perfectly matched probe. 
Typically, stringent wash conditions are those in which the salt concentration is about 
0.22 molar at pH 7 and the temperature is at least about 50°C. However, nucleic acids 
which do not hybridize to each other under stringent conditions are still substantially 
identical if the polypeptides which they encode are substantially identical. This may 
10 occur, e.g., when a copy of a nucleic acid is created using the maximum codon 
degeneracy permitted by the genetic code. 

Nucleic acids of the present invention can be identified from a cDNA or 
genomic library prepared according to standard procedures and the nucleic acids 

15 disclosed here used as a probe. For example, stringent hybridization conditions will 
typically include at least one low stringency wash using 03 molar salt (e.g., 2X SSC) 
at 65°C. The washes are preferably followed by one or more subsequent washes 
using 0.03 molar salt (e.g., 0.2X SSC) at 50°C, usually 60°C, or more usually 65°C. 
Nucleic acid probes used to isolate the nucleic acids are preferably at least 100 

20 nucleotides m length. 

As used herein, a homologue of a particular zmet2a and/or zmet2b 
methyltransferase gene is a second gene (either in the same species or in a different 
species) which encodes a protein having an amino acid sequence having at least 50% 
25 identity or 75% similarity to (determined as described above) to a polypeptide 
sequence in the first gene product. 

As used herein, "nucleotide binding site" or "nucleotide binding domain" 
includes reference to a region consisting of kinase-la, kinase 2, and kinase 3a motifs. 
30 which participates in ATP/GTP-binding. Such motifs are described for instance in Yu 
etaU Proc. Acad. ScL USA 93:1 175 1-11 756 (1996); Mindnnos. et al. Cell 78:1089- 
1099 and Shen et al, FEBS, 335:380-385 (1993). 
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As used herein, "tissue-specific promoter" includes reference to a promoter m 
which expression of an operably linked gene is limited to a particular tissue or tissues. 

As used herein "recombinant" includes reference to a cell, or nucleic acid, or 
5 vector, that has been modified by the introduction of a heterologous nucleic acid or 
the alteration of a native nucleic acid to a form not native to that cell, or that the cell is 
derived from a cell so modified. For example, recombinant cells express genes that 
are not found within the native (non-recombmant) form of the cell or express native 
genes that are otherwise abnormally expressed, under expressed or not expressed at 
10 all. 



As used herein, a "recombinant expression cassette" is a nucleic acid 
construct, generated recombmantly or synthetically, with a series of specified nucleic 
acid elements which permit transcription of a particular nucleic acid in a target cell. 
15 The expression vector can be part of a piasmid, virus, or nucleic acid fragment. 
Typically, the recombinant expression cassette portion of the expression vector 
includes a nucleic acid to be transcribed, and a promoter. 

As used herein, "transgenic plant" includes reference to a plant modified by 
20 introduction of a heterologous nucleic acid. Generally, the heterologous nucleic acid 
is a zmet2a and/or zmet2b methyltransferase structural or regulatory gene or 
subsequences or combinations thereof 



As used herein, "hybridization complex" includes reference to a duplex 
25 nucleic acid sequence formed by selective hybridization of two single-stranded 
nucleic acids with each other. 



As used herein, "amplified" includes reference to an increase in the molarity 
of a specified sequence. Amplification methods include the polymerase chain 
30 reaction (hereinafter "PCR"\ the ligase chain reaction (hereinafter "LCR"), the 
transcription-based amplification system (hereinafter "TAS"), the self-sustained 
sequence replication system (hereinafter "SSR"). A wide variety of cloning methods, 
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host cells, and in vitro amplification methodologies are well-known to persons of 
ordinary skill in the art. 



As used herein, "nucleic acid sample" includes reference to a specimen 
suspected of comprising a zmet2a and/or zmet2b methyltransferase gene. 



SEQUENCE LISTINGS 
The present application contains a number of nucleotide sequences and amino 
acid sequences. For the nucleotide sequences, the base pairs are represented by the 
10 following base codes: 





Svmbol 


Meanins 




A 


A; adenine 




C 


C: cytosine 


15 


G 


G; guanine 




T 


T; thymine 




U 


U: uracil 




M 


AorC 




R 


A or G 


20 


W 


A or T/U 




S 


C or G 




Svmbol 


Meaning 




Y 


Cor T/U 


25 


K 


G or T/U 




V 


A or C or G; not T/U 




H 


A or C or T/U; not G 




D 


A or G or T/U; not C 




B 


C or G or T/U; not A 


30 


N 


(A or C or G or T/U) 



The ammo acids shown in the application are in the L-form and are 
represented by the following ammo acid-three letter abbreviations: 



35 Abbreviation Amino acid name 

Ala L -Alanine 

Arg L-Arginine 

Asn L-Asparagme 

Asp L-Aspartic Acid 

40 Asx L-Aspartic Acid or Asparagine 

Cys L-Cysteine 

Glu L-Glutamic Acid 
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Gin 


L-Glutamme 




Glx 


L-Glutamine or Glutamic Acid 




Gly 


L-Glycme 




His 


L-Histidme 


5 


He 


L-Isoleucine 




Leu 


L-Leucme 




Lys 


L-Lysme 




Met 


L-Methionine 




Phe 


L-Phenylalanine 


0 


Pro 


L-Prolme 




Ser 


L-Senne 




Thr 


L-Threonme 




Trp 


L-Tryptophan 




Tyr 


L-Tyrosine 




Val 


L-Valme 




Xaa 


L-Unknown or other 



BRIEF DESCRIPTION OF THE DRAWINGS 



20 FIG. 1 A shows the nucleic acid sequence of the zmet2a methyltransferase 

gene containing 2736 basepairs. FIG. IB shows the nucleic acid sequence of the 
zmet2a methyltransferase gene containing 2796 basepairs. 

FIG. 2A shows the ammo acid sequence of the zmet2a methyltransferase 
25 containing 912 amino acids and which is encoded by the nucleic acid sequence shown 
in FIG. 1 A. FIG. 2B shows the amino acid sequence of the zmet2a methyltransferase 
containing 932 amino acids and which is encoded by the nucleic acid sequence shown 
in FIG. IB. 

30 FIG. 3 shows the PCR primers used to sequence the zmet2a methyltransferase 

gene. 

FIG. 4 is a summary of the major classes of 5-cytosme methyltransferases 
from mammals, Arabidopsis and maize. 

35 

FIG. 5 shows the genomic sequence of zmet2a methyltransferase gene and the 
retrotransposon SPRITE- 1 , along with intron-exon divisions, a restriction site map 
and a primer map. 
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FIG. 6 lists the World Wide Web sites used to process the sequence data for 
the zmet2a methyltransferase gene. 

5 FIG. 7 shows a Southern blot of B73 DNA digested with Hindlll and probed 

with clone CGET064. The Southern blot shows the presence of multiple copies of 
zmet2a or zmet2a-like genes in the B73 genome. DNA from B73 was digested with 
Hindlll and probed with clone CGET064 which does not contain a Hindlll site. The 
gene cloned and sequenced is represented by the upper band. 

10 

FIG. 8 shows the alignment of the amino acid sequence from zmet2a with the 
amino acid sequence of Arbadiopsis chromosmethylase CMTl (AF039367) and the 
C-terminal methylase domains from the DNA methyitransferases of maize zmetl 
(AF063403) and Arabidopsis MET1 (P3488I). Zmet2a shows similarity along the 

15 entire length of CMTl but significant similarity with zmetl and Metl exists only in 
the conserved motifs. Bold, uppercase, normal uppercase letters, and lower case 
letters indicate identity, conservation, and differences in amino acid sequences 
relative to zmet2a respectively. Dashes in the sequences are gaps introduced by 
CLUSTAL W to optimize the alignments. The location of the six conserved 

20 methyiase motifs are indicated in the sequence. The chromodomain is located 

upstream and adjacent to motif IV. The Mu insertion into the coding region of motif 
IX alters zmet2a function resulting in decreased methylation at CpNpG sites. Putative 
nuclear loalization signal peptides, NLS (N. Raikhei, Plant Physiol. 100, 1627 (1992)) 
are positioned in the N-terminal portion of the protein. 

25 

FIG. 9 lists the putative identification of zmet2a ammo acids involved in 
catalysis by comparison with ammo acids of M.Hhal with known catalytic functions. 
The amino acids of M.Hhal with catalytic functions were determined by 
crystalography by Cheng et ah. Cell 74:299-307 (1993). Ammo acid of zmet2a are 
30 numbered as in Figure 7. 

FIG. 10 shows southern analysis of repetitive DNA methylation patterns. 
Total genomic DNA (5 ug per lane) from an F 4 derived Fs family segregating for 
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zmet2a:M// was digested with isoschizomers Hpall and Mspl which recoginze the 
sequence CCGG. Digested DNA ws eiectophoresed through 0.8% agarose, 
transferred to nylon membrane, and hybridized with probes for repetitive DNA; the 
9kb 26s-5.8s-17s ribosomal repeat (FIG. 10A), 5s nbosomai repeat (FIG. 10B), and a 

5 centromenc repeat pSau3a9 (FIG. 10C). Decreased methyiation is observed in 
mutant plants (- -) relative to nonmutant plants (-r -f ) digested with Mspl which is 
sensitive to methyiation at me CpCpG sequnces. No changes in methyiation patterns at 
me CpG sits are observed in mutant plants as indicated by the lack of digestion with 
Hpall. Plants heterozygous for zmet2a:A/;// (- -) also show decreases at me CpCpg 

0 sites. 

FIG. 1 i shows gels from a Southern analysis which demonstrate that plants 
homozygous for zmet2a::Mul have decreased methyiation at CpNpG sites. More 
sites cut with restriction enzymes that are sensitive to methyiation at CpNpG sites in 
zrnetl&iMul plants. EcoRll is sensitive to methyiation at CC*A/TGG sites where * 
indicates the senitive cytosine (FIG. 1 1 A). Bglll is sensitive to methyiation at 
AGATC*T sites (FIG. 1 IB). Pstl is sensitive to methyiation at C*TGCAG sites 
(FIG. 1 1C). BamHl is sensitive to methyiation at GGATC*C sites (FIG. 1 ID). Avail 
is sensitive to methyiation at GGA/TC*C sites (FIG. 1 IE). Changes at CpG sites 
cannot be sparated from CpCpG m the Avail digests. DNA from the same plants as 
those in Figure 10 were digested and hybridized with the repetitive probes as 
described herein. 

FIG. 12 shows the cytosine methyiation levels in an F4 derived F5 segregating 
5 line for zrnet2a:;Mul . 5-methyicytosine content of DNA extracted from tissue of 
immature 5 lh -7 th leaves was determined by reverse phase HPLC using the method of 
Gehrike et al. Values were obtained from three wildtype plants, seven heterozygous 
plants and five homozygous plants. Two samples were run for each plant. 
Percentages of 5mC content [5mC/(5mC + C)] were calculated from concentrations 
0 determined from integration of peak and comparison to known standards. 

FIG. 13 shows gels from a Southern analysis which demonstrate that plants 
homozygous for zmet2a::Mui having a reduced level of methyiation that is stable 
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over generations. Two F 2 derived F 3 families homozygous for zmet2a:Mul, B5 and 
B6, were self pollinated to the F 6 generation. Two lineages from B5 and three 
lineages from B4 were grown at the University of Wisconsin. West Madison 
Agronomy Farm in 1999. Methylation levels are consistent across generations. Ono 
5 zmet2a:A/;// is in a homozygous state, methylation is reduced to a specific level and 
no further reductions occur. Dilution of methylation is not observed in each 
successive generation. DNA from leaf tissue was digested with Mspl and the 
Southern blot was hybridized with 9kb ribosomal repetitive probe. 

10 FIG. 14 shows gels from a Southern analysis winch demonstrate that 

methylation levels are restored to nonmutant parental levels in backcross progeny 
homozygous for wildtype zmet2a. An Fl hybrid of an F4 line homozygous for 
zmet2a::AM (lanes 1-3) and the inbred line Mol7 (lanes 4-6) was backcrossed to the 
nonmutant Mo 17 parent ot generate plants homozygous wildtype and plants 

15 heterozygous for zmet2a:A/«/. Fl plants (lanes 7-11) have methylation levels 

intermediate those of the parents. BC1 progeny heterozygous for zmet2a:Af«/ (lanes 
12-17) have methylation levels similar to the Fl. BC1 plants restored to wild-type 
zmet2a (lanes 18-21) have remethylation to levels comparable to the nonmutant 
parent line. Complete or near complete remethylation has occurred within one sexual 

20 generation. DNA was extracted from the 4 th - 6 th immature leaves of greenhouse 
grown seedlings, digested with Pstl which is sensitive to methylation at me CTGCAG 
sequences, and hybridized to the pSau3a9 centromeric repeat. 

FIG. 15 shows gels from a Southern analysis which demonstrate the 
25 expression of zmet2a in different tissues during development. Southern blots were 
produced with cDNA's synthesized from mRNA extracted from embryos 24 days 
after pollination (hereinafter tL DAP"), young leaves, immature ear, immature tassel, 
BMS callus, and 10 day old seedlings. Figure 15A shows the ethidium bromide 
stained gel. All lanes were loaded with 750 ng of cDNA except for the 10 day 
30 seedlings, of which 280 ng was loaded due to the limited amount available. The 

cDNA^s were quantified by spectrophotometry. The marker lane contains 800 ng of 
lambda DNA digested with Hindlll, Figure 15B shows the Southern blot hybridized 
with the zmet2a cDNA probe. Hybridization is observed in tissues that are actively 
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undergoing ceil division. Figure 15C shows the same blot hybridized to a ubiquitin 
probe to show cDNA loading variation. 

FIG. 16 shows the structure of maize retrotransposon SPRITE- 1 and sequence 
5 of Long Terminal Repeat (hereinafter vi LTR") components. FIG. 16A shows that 
SPRITE- 1 consists of long terminal direct repeats, a tRNA primer binding site 
(hereinafter "PBS"), coding sequence for proteins necessary for replication and 
transposition, and a polypurine tract (hereinafter "PPT"). FIG. 16B identifies the 
sequences for the 5* and 3' LTR , PBS and PPT. Each LTR has a 3 base pair inverted 
10 repeat which is also shown in the drawing. A putative TATA box is underlined and 
the putative transcription start site is italicized. The 5 base pair host insertion site 
duplications are also identified, 

FIG. 17 shows the alignments of the conserved protein motifs of the Tyl/copia 
15 elements with SPRITE- 1. The maize retrotransposon SPRITE- 1 is aligned with the 
retrotransposon hopscotch (U2626) from maizze, retrofit (U72725) from rice, an 
unpublished Arabidopsis retrotransposon (AC006528) and the copia element from 
Drosophila(M11240). 

20 FIG. 1 8 shows that the SPRITE- 1 copy number and insertion sites differ 

among maize inbred lines. DNA (7 jj.g) from inbred maize lines, barley, ice. rye, 
wheat and potato was digested with BcoRI which does not cut within the 
retroelement sequence. The Southern blot was hybridized with a 950 bp SPRITE- 1 
fragment which includes the 5' untranslated sequence and 5 1 sequence putatively 

25 coding for the gag protein but does not include the conserved gag motif or the 5' 
terminal repeat. 

FIG. 19 shows the identification of inbred lines containing a SPRITE- 1 
insertion in zmet2a. PCR was conducted on maize inbred lines from various origins 
30 using a primer upstream of the SPRITE- 1 insertion site 15F in conjuction with a 
SPRITE- 1 specific primer 1 8R or a zmet2a primer downstream of the element 8R. 
The upper panel (15F/I8R) show the inbreds that do not have a SPRITE- 1 insertion. 
The lower panel (15F/18R) shows that Moll 7 and A682 have a SPRITE- 1 insertion 
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into zmet2a. A682 has an amplification product from both primer sets indicating that 
it may be hemizygous for SPRITE- 1 . 

FIG. 20 shows expression of retroelement SPRITE- 1. Figure 20 A shows a 
5 Southern blot of cDNAs from roots, immature embryo 24 days after pollination 
(hereinafter, "DAP"), young leaf, young leaf with inacive zmet2a immature ear, 
immature tassel, mature pollen. Black Mexican Sweet (hereinafter, "BMS") callus, 
and 10 day seedling, hubridized with a SPRITE- 1 probe. Transcription of SPRITE- 1 
is evident as indicated by the hybridization to cDNA from embryo, and leaf tissue. 
10 Expression is highest in leaf tissue with significantly more expression being observed 
m leaf tissue from zmet2a:Mul plants that have decreased CpNpG methylation. FIG. 
20B shows the same Southern blot hybridized to a ubiquitin probe as a loading 
control. 

15 FIG. 21 shows that the presence of a SPRITE- 1 insertion into a zmet2a intron 

does not alter transcript splicing. Fragmetns spanning the SPRITE- 1 insertion and 
downstream from the insertion site were amplified by PCR from cDNA's. FIG. 21 A 
shows a scaled representation of zmet2a. Exons are represented by large blocks while 
the intervening introns are depicted by lines. The insertion of the retroelement is 

20 indicated above the zmet2a diagram. The element is inserted in the opposite 

orientation relative to zmet2a as indicated by the boxed arrows which represent the 
direct repeats. Positions of the primers used to generate fragments are indicated 
below the zmet2a diagram. Fragments were amplified from B73 (FIG. 21B) 
immature ear cDNA which does not contain the retroelement insertion and Mo 17 (M) 

25 embryo 24 days after pollination cDNA (FIG. 2 1 B) and Mo 17 (M) 10 day seedling 
cDNA (FIG. 21C). No differences were observed on the ethidium bromide stained 
gel of the PCR products. FIGS. 2 IB and 2 IC show hybridization of a near full length 
B73 cDNA probe to a Southern blot of the PCR fragments. 

30 FIG. 22 shows the methylation status of SPRITE- 1 . DNA from immature 

leaves was digested with methylation sensitive restriction enzymes. Southern blots 
were hybridized with a 970 base pair fragment from the 5' end of the untranslated 
region of SPRITE- 1. There are 5 BstNI/EcoRII sites, 1 Mspl/Hpall sites and 1 PstI 
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site within the sequence context of this probe. Nearly all sites are methylated in this 
region. 

FIG. 23 shows a partial nucleic acid sequence of the zmet2b methyltransferase 

5 gene. 

FIG. 24 shows a partial ammo acid sequence of the zmet2b methyltransferase 
encoded by the partial nucleic acid sequence shown in FIG. 23. 

]0 HG. 25 shows a comparison of a portion of the amino acid sequence for 

zmet2a methyltransferase with a portion of the ammo acid sequence for zmet2b 
methyltransferase. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 
15 In one embodiment, the present invention relates to a zmet2a 

methyltransferase gene. The zmet2a methyltransferase gene of the present invention 
encodes a class II methyltransferase gene which controls CpNpG methylation. 
Nucleic acid sequences from the zmet2a methyltransferase gene described herein can 
be used to reduce or to alter the level of DNA methylation in a plant In addition, the 
20 zmet2a nucleic acid sequence described herein can be used to methylate a targeted 
gene in a plant in vivo to "silence" or "knock-out" said gene. 

In another embodiment, the present invention relates a zmet2b 
methyltransferase gene. The zmet2b methyltransferase gene can be isolated using a 

25 partial zmet2b methyltransferase gene described herein. Like the zmet2a 

methyltransferase gene, the zmet2b methyltransferase gene encodes a class II 
methyltransferase gene which controls CpNpG methylation. Nucleic acid sequences 
encoding the zmet2b methyltransferase gene can be used in the same manner as the 
nucleic acid sequence encoding the zmet2a methyltransferase gene to reduce or to 

30 alter the level of DNA methylation in a plant. In addition, the zmet2b nucleic acid 
sequence can be used to methylate a targeted gene in a plant in vivo to "silence 1 ' or 
"knock-out" said gene. 
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The present invention is applicable to a broad range of types of 
monocotyledonous and dicotyledonous plants, including, but not limited to, Zea mays, 
Oryza saliva, Secale cereale* Triticum aesrivum. Daucus carota, Brassica oleracea, 
Cucumis melo. Cucumis sativus, Latitca saliva. Solarium tubersoum, Lvcopersicon 
5 esculentwiu Phaseolus vulgaris, and Brassica napus. 

The nucleic acids of the present invention can be used in marker-aided 
selection. Marker-aided selection does not require the complete sequence of the gene 
or precise knowledge of which sequence confers which specificity. Instead, partial 

0 sequences can be used as hybridization probes or as the basis for oligonucleotide 

primers to amplify by PCR or other methods to follow the segregation of chromosome 
segments containing the zmet2a and/or zmet2b methyltransferase gene(s) in plants. 
Because ihe zmet2a or zmet2b methyltransferase marker is the gene itself, there can 
be negligible recombination between the marker and the methylated phenotype. 

5 Thus, the nucleic acids of the present invention can be used to provide an optimal 

means to DNA fingerprint class II DNA methyitransferases in other cultivars and wild 
germpiasm. This can be used to indicate if other germplasm accessions and cultivars 
carry the same zmet2a and/or zmet2b methyltransferase genes. 

0 Preparation of the Nucleic acids of the Present Invention 

Generally, the nomenclature and the laboratory procedures involved with 
recombinant DNA technology described below are those well known and commonly 
employed by those of ordinary skill in the art. Standard techniques are used for 
cloning, DNA and RNA isolation, amplification and purification. Generally, 

5 enzymatic reactions involving DNA ligase. DNA polymerase, restriction 
endonucleases and the like are performed according to the manufacturer's 
specifications. These techniques and various other techniques are generally 
performed according to Sambrook et aL, Molecular Cloning - A Laboratory Manual, 
Cold Spring Harbor Laboratory, Cold Spring Harbor, New York (1989). 



The isolation of zmet2a and/or zmet2b methyltransferase gene(s) can be 
accomplished via a number of techniques. For instance, oligonucleotide probes based 
on the sequences disclosed herein can be used to identify the desired gene m a cDNA 
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or genomic DNA library. To construct genomic libraries, large segments of genomic 
DNA are generated by random fragmentation, e.g. using restriction endonucleases, 
and are ligated with vector DNA to form concatemers that can be packaged into the 
appropriate vector. To prepare a cDNA library, mRNA is isolated from the desired 
5 organ of a particular plant, such as shoots from Zea mays, and a cDNA library which 
contains the zmet2a or zmet2b methyltransferase gene transcript is prepared from the 
mRNA. Alternatively cDNA may be prepared from mRNA extracted from other 
tissues in which the zmet2a or zmet2b methyltransferase gene or homologs are 
expressed. 

]0 

The cDNA or genomic library can then be screened using a probe based upon 
the sequence of a cloned zmet2a and/or zmet2b methyltransferase gene or partial 
sequence from either thereof (such as the partial zmet2b methyltransferase nucleic 
acid sequence shown in FIG. 23). Probes may be used to hybridize with genomic 
15 DNA or cDNA sequences to isolate homologous genes in the same or different plant 
species. 

Those of ordinary skill in the art will appreciate that various degrees of 
stringency of hybridization can be employed in the assay; and either the hybridization 

20 or the wash medium can be stringent. As the conditions for hybridization become 
more stringent, there is a greater degree of complementarity required between the 
probe and the target for duplex formation to occur. The degree of stringency can be 
controlled by temperature, ionic strength, pH and the presence of a partially 
denaturing solvent such as fermamide. For example, the stringency of hybridization 

25 is conveniently varied by changing the polarity of the reactant solution through 
manipulation of the concentration of formamide within the range of 0% to 50%. 

Alternatively, the nucleic acids of interest can be amplified from nucleic acid 
samples using amplification techniques. For instance, polymerase chain reaction 
30 (hereinafter "PCR") technology can be used to amplify the sequences of the zmet2a 
and/or zmet2b methyltransferase and related genes directly from genomic DNA, from 
cDNA, from genomic libraries or from cDNA libraries. PCR and other in vitro 
amplification methods may also be useful, for example, to clone nucleic acid 
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sequences that code for proteins to be expressed, to make nucleic acids to use as 
probes for detecting the presence of the desired mRNA in samples, for nucleic acid 
sequencing, or for other purposes. 

5 The degree of complementarity (sequence identity) required for detectable 

binding will vary in accordance with the stringency of the hybridization medium 
and/or wash medium. The degree of complementarity will optimally be 100 percent; 
however, it should be understood that minor sequence variations in the probes and 
primers may be compensated for by reducing the stringency of the hybridization 

10 and/or wash medium as described earlier. 

Appropriate primers and probes for identifying zmet2a and/or zmet2b 
methyltransferase nucleic acid sequences from plant tissues are generated from a 
comparison of the sequences provided herein. For a general overview of PCR see 
15 PCR Protocols: A Guide to Methods and Applications. (Innis, M. Gelfand, D., 

Snisky, J. and White, T.. eds). Academic Press, San Diego (1990), incorporated herein 
by reference. 

Nucleic acids may also be synthesized by well-known techniques as described 
20 in the technical literature. See e.g., Curruthers et ai, Cold Spring Harbor Symp. 

Quant. Biol 47:41 1-418 (1982), and Adams et ai, J. Am. Chem. Soc. 105:661 (1983). 
Double stranded DNA fragments may then be obtained either by synthesizing the 
complementary strand and annealing the strands together under appropriate 
conditions, or by adding the complementary strand using DNA polymerase with an 
25 appropriate primer sequence. 

Proteins of the Present Invention 

The present invention further provides for isolated zmet2a and/or zmet2b 
methyltransferases encoded by the zmet2a and/or zmet2b methyltransferase nucleic 
30 acids disclosed herein. One of ordinary skill in the art will recognize that nucleic 
acids encoding a functional zmet2a or zmet2b methyltransferase need not have a 
sequence identical to the exemplified genes disclosed herein. For example, because 
of codon degeneracy, a large number of nucleic acid sequences can encode the same 
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polypeptide. In addition, the polypeptides encoded by the zmet2a and/or zmet2b 
methyltransferase genes, like other proteins, have different domains which perform 
different functions. Specifically, zmet2a methyltransferase has ten (10) domains. 
These ten domains are identified as follows; I, chromodomain [32, chromodomam (33. 
5 IV, VI. VIII, IX and X. The ten domains and their sequence ranges (as shown in SEQ 
ID NO:2) are listed below in Table 1 : 

TABLE 1 

Domain Ammo Acid Sequence Range 



10 I 


244-27 1 


Chromodomain P2 


366-379 


Chromodomain P3 


380-388 


IV 


411-434 


VI 


456-476 


15 VIII 


496-520 


IX 


723-746 


X 


751-775 



Domains I and X are involved in binding AdoMet, which is source of the 
20 methyl group to be transferred during DNA methylation. Domain IV contains a 

catalytic domain. Domain VI aids in the positioning of domain IV. Domain VIII aids 
in DNA binding by neutralizing the charge of the phosphodiester backbone. The 
region between domain VIII and domain IX defines the sequence specificity of the 
zmet2a methyltransferase enzyme. Thus, the zmet2a methyltransferase gene 
25 sequences need not be full length, so long as the desired functional domain of the 
protein is expressed. 

The zmet2a methyltransferase protein is at least 912 ammo acid residues m 
length (see FIG. 2A), preferably, 932 ammo acid residues m length (see FIG. 2B). 
30 However, those of ordinary skill in the art will appreciate that amino acid deletions, 
substitutions, or additions to the zmet2a methyltransferase protein will typically yield 
a enzyme possessing methylating characteristics similar or identical to that of the full 
length sequence. Thus, full length zmet2a methyltransferase proteins modified by 1, 
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2, 3, 4, or ;> deietions. substitutions, or additions, generally provide an effective 
degree of methylation relative to the full-length protein. 

A partial ammo acid sequence of the zmet2b methyl transferase protein is 
5 provided for in FIG. 24 and is 256 ammo acids in length. 

Modified protein chains can also be readily designed utilizing various 
recombinant DNA techniques well known to those of ordinary skill in the art. For 
example, the chains can vary from the naturally occurring sequence at the primary 
10 structure level by amino acid substitutions, additions, deletions, and the like. 

Modification can also include swapping domains from the proteins of the present 
invention with related domains from other class II methyltransferases. 

The present invention also provides antibodies which specifically react with 
15 the zmet2a and/or zmet2b methyltransferase(s) of the present invention under 

immunologically reactive conditions. An antibody immunologically reactive with a 
particular antigen can be generated in vivo or by recombinant methods such as by 
selection of libraries of recombinant antibodies in phage or similar vectors. The term 
^immunologically reactive conditions" as used herein, includes reference to 
20 conditions which allow an antibody, generated to a particular epitope of an antigen, to 
bind to that epitope to a detectably greater degree than the antibody binds to 
substantially all other epitopes, generally at least two times above background 
binding, preferably at least five times above background. Immunologically reactive 
conditions are dependent upon the format of the antibody binding reaction and 
25 typically are those utilized m immunoassay protocols. 

The term "antibody' 1 as used herein, includes reference to an immunoglobulin 
molecule obtained by in vitro or vivo generation of the humoral response, and 
includes both polyclonal and monoclonal antibodies. The term also includes 
30 genetically engineered forms such as chimeric antibodies (e.g., humanized murine 
antibodies), heteroconjugate antibodies (e.g., bispecific antibodies), and recombinant 
single chain Fv fragments (scFv). The term "antibody" also includes antigen binding 
forms of antibodies (e.g., Fab ! , F(ab') 2 , Fab, Fv, and, inverted IgG. See, Pierce 
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Catalog and Handbook. 1994-1995 ) Pierce Chemical Co., Rockford. IL). An 
antibody immunologically reactive with a particular antigen can be generated in vivo 
or by recombinant methods such as selection of libraries of recombinant antibodies in 
phage or similar vectors {See, e.g. Huse et aL, (1989) Science 246:1275-1281; and 
5 Ward, et aL (1989) Nature 341:544-546; and Vaughan et al., (1996) Nature 
Biotechnology, 14:309-314). 

Many methods of making antibodies are known to persons of ordinary skill in 
the art. A number of immunogens are used to produce antibodies specifically reactive 
10 to the zmet2a and/or zmet2b methyltransferase(s) of the present invention under 

immunologically reactive conditions. An isolated recombinant, synthetic, or native 
zmet2a and/or zrnet2b methyltransferase(s) of the present invention is the preferred 
immunogens (antigen) for the production of monoclonal or polyclonal antibodies. 

15 The zmet2a and/or zmet2b methyltransferase(s) is then injected into an animal 

capable of producing antibodies. Either monoclonal or polyclonal antibodies can be 
generated for subsequent use in immunoassays to measure the presence and quantity 
of the zmet2a and/or zmet2b methyltransferases. Methods of producing monoclonal 
or polyclonal antibodies are known to those of skill in the art (See, Coligan (1991) 

20 Current Protocols in Immunology- Wiley/Greene, NY; and Harlow and Lane (1989) 
Antibodies: A Laboratory 7 Manual Cold Spring Harbor Press. NY); Goding (1986) 
Monoclonal Antibodies: Principles and Practice (2d ed.) Academic Press, New 
York. NY). 

Frequently, the zmet2a and/or zmet2b methyltransferase(s) and antibodies will 
25 be labeled by joining, either covalently or non-covaiently. a substance which provides 
for a detectable signal. A wide variety of labels and conjugation techniques are 
known and are reported extensively m both the scientific and patent literature. 
Suitable labels include radionucleotides. enzymes, substrates, cofactors, inhibitors, 
fluorescent moieties, chemiluminescent moieties, magnetic particles, and the like. 
30 Patents teaching the use of such labels include U.S. Patent Nos. 3,817,837: 3,850.752; 
3,939,350; 3.996,345; 4,277,437; 4,275,149; and 4.366.241. 
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The antibodies of the present invention can be used to screen plants for the 
expression of the zmet2a and/or zmet2b methyltransferase(s). The antibodies of the 
present invention are aiso used for affinity chromatography in isolating zmet2a 
and/zmet2b methyltransferase(s). 

5 

The present invention further provides zmet2a and/or zmet2b 
methyltransferase polypeptides that specifically bind, under immunologically reactive 
conditions, to an antibody generated against a defined immunogen, such as an 
immunogen consisting of the polypeptides of the present invention. For example, 

10 immunogens will generally be at least 912 contiguous amino acids from the zmet2a 
methyltransferase polypeptide of the present invention. Nucleic acids which encode 
such cross-reactive zmet2a and/or zmet2b methyltransferase polypeptides are also 
provided by the present invention. The zmet2a/zmet2b methyltransferase 
polypeptides can be isolated from any number of plants as discussed earlier. 

15 Preferred plants are Zea mays, Oryza sativa, Secale cereale. Triticum aestivum. 

Daucus carota, Brassica oleracea, Cucumis melo, Cucumis sativus, Latuca saliva, 
Solarium tubersoum, Lycopersicon esculentum, Phaseolus vulgaris, and Brassica 
napus. 

20 As used herein, the term, "specifically binds" includes reference to the 

preferential association of a ligand, in whole or part, with a particular target molecule 
(i.e.. "binding partner" or l 'binding moiety 1 ' relative to compositions lacking that 
target molecule). It is, of course, recognized that a certain degree of non-specific 
interaction may occur between a ligand and a non-target molecule. Nevertheless, 

25 specific binding, may be distinguished as mediated through specific recognition of the 
target molecule. Typically, specific binding results in a much stronger association 
between the ligand and the target molecule than between the ligand and non-target 
molecule. Specific binding by an antibody to a protein under such conditions requires 
an antibody that is selected for its specificity for a particular protein. The affinity 

30 constant of the antibody binding site for its cognate monovalent antigen is at least 10', 
usually at least 10 9 , more preferably at least 10 10 , and most preferably at least 10 11 
liters/mole. A variety of immunoassay formats are appropriate for selecting 
antibodies specifically reactive with a particular protein. For example, solid-phase 
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ELISA immunoassays are routinely used to select monoclonal antibodies specifically 
reactive with a protein (See Harlow and Lane (1988) Antibodies, A Laboratory* 
Manual, Cold Spring Harbor Publications. New York, for a description of 
immunoassay formats and conditions that can be used to determine specific 
5 reactivity). The antibody may be polyclonal but preferably is monoclonal. Generally, 
antibodies cross-reactive to zmet2a anchor zmet2b methyltransferases are removed by 
immunoabsorbtion. 

Immunoassays in the competitive binding format are typically used for cross- 
10 reactivity determinations. For example, an immunogenic zmet2a and/or zmet2b 

methyltransferase polypeptide is immobilized to a solid support. Polypeptides added 
to the assay compete with the binding of the antisera to the immobilized antigen. The 
ability of the above polypeptides to compete with the binding of the antisera to the 
immobilized zmet2a and/zmet2b methyltransferase polypeptides are compared to the 
15 immunogenic zmet2a and/or zmet2b methyltransferase polypeptide(s). The percent 
cross-reactivity for the above proteins is calculated, using standard calculations. 
Those antisera with less than 10% cross-reactivity with such proteins as zmet2a 
and/or zmet2b methyltransferase(s) are selected and pooled. The cross-reacting 
antibodies are then removed from the pooled antisera by immunoabsorbtion w r ith the 
20 non-zmet2a and/or non-zmet2b methyltransferase polypeptide(s). 

The immunoabsorbed and pooled antisera are then used in a competitive 
binding immunoassay to compare a second "target" polypeptide to the immunogenic 
polypeptide. In order to make this comparison, the two polypeptides are each assayed 

25 at a wide range of concentrations and the amount of each polypeptide required to 
inhibit 50% of the binding of the antisera to the immobilized protein is determined 
using standard techniques. If the amount of the target polypeptide required is less 
than twice the amount of the immunogenic polypeptide that is required, then the target 
polypeptide is said to specifically bind to an antibody generated to the immunogenic 

30 protein. As a final determination of specificity, the pooled antisera is fully 
immunoabsorbed with the immunogenic polypeptide until no binding to the 
polypeptide used in the immunoabsorbtion is detectable. The fully immunoabsorbed 
antisera is then tested for reactivity with the test polypeptide. If no reactivity is 
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observed, then the test polypeptide is specifically bound by the antisera elicited by the 
immunogenic protein. 

Production of Recombinant Expression Cassettes 
5 Isolated sequences prepared as described herein can then be used to provide 

recombinant expression cassettes. One of ordinary skill in the art will recognize that 
the nucleic acids used in the recombinant expression cassettes described herein 
encoding a functional zmet2a and/or zmet2b methyltransferase(s) need not have a 
sequence identical to the exemplified genes disclosed herein. In addition, the 
10 polypeptides encoded by the zmet2a and/or zmet2b methyltransferase genes, like 
other proteins, have different domains which perform different functions. Thus, the 
zmet2a and' or zmet2b methyltransferase gene sequences need not be full length, so 
long as the desired functional domain of the protein is expressed. 

15 A DNA sequence coding for the desired zmet2a and/or zmet2b 

methyltransferase polypeptide(s), for example a cDNA or a genomic sequence 
encoding a full length protein, can be used to construct a recombinant expression 
cassette which can be introduced into a desired plant. An expression cassette will 
typically comprise the zmet2a and/or zmet2b methyltransferase nucleic acid(s) 

20 operably linked in either the sense or antisense direction to transcriptional and 

translational initiation regulatory sequences which will direct the transcription of the 
sequence from the zmet2a and/or zmetZb methyltransferase gene(s) in the intended 
tissues for the transformed plant. 

25 For example, a plant promoter fragment may be employed which will direct 

expression of the zmet2a and/or zmet2b methyltransferase in all tissues of a 
regenerated plant. Such promoters are referred to herein as "constitutive" promoters 
and are active under most environmental conditions and states of development or cell 
differentiation. Examples of constitutive promoters includes the cauliflower mosaic 

30 virus (CaMV) 35S transcription initiation region, the V or 2' - promoter derived from 
T-DNA of Agrobactenum tumefaciens* and ubiquitin other transcription initiation 
regions from various plant genes known to those of ordinary skill in the art. 
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Alternative]), the plant promoter may direct expression of the zmet2a and/or 
zmet2b methyitransferase gene in a specific tissue or may be otherwise under more 
precise environmental or developmental control. Such promoters are referred to here 
as ''inducible" promoters. Examples of environmental conditions that may effect 
5 transcription by inducible promoters include pathogen attack, anaerobic conditions, or 
the presence of light. 

Examples of promoters under developmental control include promoters that 
initiate transcription only in certain tissues, such as leaves, roots, fruit, seeds, or 
0 flowers. The operation of a promoter may also vary depending on its location m the 
genome. Thus, an inducible promoter may be fully or partially constitutive in certain 
locations. 

The endogenous promoters from the zmet2a and/or zmet2b methyitransferase 
5 genes of the present invention can be used to direct expression of the genes. These 
promoters can also be used to direct expression of heterologous structural genes. The 
promoters can be used, for example, m recombinant expression cassettes to drive 
expression of genes to produce DNA methyitransferase in a particular cell or tissue. 

0 To identify the promoters, the 5 portions of the clones described herein are 

analyzed for sequences characteristic of promoter sequences. For instance, promoter 
sequence elements include the TATA box consensus sequence (TATAAT), which is 
usually 20 to 30 base pairs upstream of the transcription start site. In plants, further 
upstream from the TATA box, at positions -80 to -100, there is typically a promoter 

5 element with a series of adenines surrounding the trinucleotide G (or T) N G. J. 

Messing et al, in Genetic Engineering in Plants, pp. 221-227 (Kosage, Meredith and 
Hollaender, eds. 1983). 

If proper polypeptide expression is desired, a polyadenylation region at the 3 - 
0 end of the zmet2a or zmet2b methyitransferase coding region should be included. 
The polyadenylation region can be derived from the natural gene, from a variety of 
other plant genes, or from T-DNA. 
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The vector comprising the sequences from the zmet2a and/or zmet2b 
methyltransferase gene(s) will typically comprise a marker gene which confers a 
selectable phenotype on plant cells. For example, the marker may encode biocide 
resistance, particularly antibiotic resistance, such as resistance to kanamycin, G418, 
5 bleomycin, hygromycin, or herbicide resistance, such as resistance to chlorosulforon. 

As discussed above, the zmet2a and/or zmet2b methyltransferase geneisj can 
be inserted into a recombinant expression cassette in the antisense direction. 
Expression of the zmet2a and/or zmet2b methyltransferase gene(s) in antisense 

10 direction will result in the production of antisense RNA. As is well known, a cell 
manufactures protein by transcribing the DNA of the gene encoding a protein to 
produce RNA. which is then processed to messenger RNA (mRNA) (e.g., by the 
removal of mtrons) and finally translated by ribosomes into protein. This process 
may be inhibited in the cell by the presence of antisense RNA. The term antisense 

15 RNA means an RNA sequence which is complementary to a sequence of bases in the 
mRNA in question in the sense that each base (or the majority of bases) in the 
antisense sequence (read in the 3' to 5" sense) is capable of pairing with the 
corresponding base (G with C, A with U) in the mRNA sequence read in the 5 1 to 3' 
sense. It is believed that this inhibition takes place by formation of a complex 

20 between the two complementary strands of RNA, thus preventing the formation of 
protein. How this works is uncertain: the complex may interfere with further 
translation, or degrade the mRNA, or have more than one of these effects. This 
antisense RNA may be produced m the cell by transformation of the cell with an 
appropriate DNA construct designed to transcribe the non-template strand (as opposed 

25 to the template strand) of the relevant gene (or of a DNA sequence showing 
substantial homology therewith). 

The use of antisense RNA to downregulate the expression of specific plant 
genes is well known. Reduction of gene expression has led to a change in the 
30 phenotype of a plant, either at the level of gross visible phenotypic difference (e.g., 
lack of anthocyanin production in flower petals of petunia leading to colorless instead 
of colored petals (see van der Krol et aL, Nature, 333:866-869 (1988)), or at a more 
subtle biochemical level, for example, a change in the amount of polygalacturonase 
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and reduction in depolymerization of pectin during tomato fruit ripening (Smith et aL 
Nature, 334:724-726 (1988)). Another more recently described method of inhibiting 
gene expression in transgenic plants is the use of sense RNA transcribed from an 
exogenous template to downregulate the expression of specific plant genes 
5 (Jorgensen, Keystone Symposium "Improved Crop and Plant Products through 

Biotechnology", Abstract XI -022 (1994)). Thus, both antisense and sense RNA have 
been proven to be useful m achieving downregulation of gene expression in plants, 
which are encompassed by the present invention. 

JO Production of Transgenic Plants 

Techniques for transforming a wide variety of higher plant species using the 
recombinant expression cassettes hereinbefore described are well known and 
described in the technical and scientific literature. See. for example. Weising et al, 
Ann. Rev. Genet. 22:421-477 (1988). 

15 

The hereinbefore described recombinant expression cassettes may be 
introduced into the genome of a desired plant host by a variety of conventional 
techniques. For example, the DNA construct may be introduced directly into the 
genomic DNA of the plant cell using techniques such as electroporation, PEG 

20 poration, particle bombardment and microinjection of plant cell protoplasts or 

embryogenic callus, or the DNA constructs can be introduced directly to plant tissue 
using ballistic methods, such as DNA particle bombardment. In the alternative, the 
DNA constructs may be combined with suitable T-DNA flanking regions and 
introduced into a conventional Agrobacterium tumefaciens or Agrobacterium 

25 rhizogenes host vector. The virulence functions of the Agrobacterium host will direct 
the insertion of the construct and adjacent marker into the plant cell DNA when the 
cell is infected by the bacteria. 

Transformation techniques are known in the art and well described in the 
30 scientific and patent literature. The introduction of DNA constructs using 

polyethylene glycol precipitation is described in Paszkowski et al, EMBOJ. 3:2712- 
2722 (1984). Electroporation techniques are described in Fromm et al. r Proc. Natl. 
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Acad. Sci. USA 82:5824 (1985). Biohstic transformation techniques are described in 
Klein et aL. Nature 327:70-73 (1987). 

Agrobactehum tumefaciens-mt&idxzd transformation techniques are well 
5 described m the scientific literature. See. for example Horsch et aL, Science 233:496- 
498 (1984). and Fraley et aL. Proc. Natl. Acad. Sci. USA 80:4803 (1983). Although 
Agrobactenum is useful primarily in dicots, certain monocots can be transformed by 
Agrobactehum. For instance, Agrobactenum transformation of rice is described by 
Hiei etaL Plant J., 6:271-282 (1994). 

10 

Transformed plant cells which are derived by any of the above transformation 
techniques can be cultured to regenerate a whole plant which possesses the 
transformed genotype. Such regeneration techniques rely on manipulation of certain 
phytohormones m a tissue culture growth medium, typically relying on a biocide 

15 and/or herbicide marker which has been introduced together with the zmet2a and/or 
zmet2b methykransferase nucleotide sequence(s). Plant regeneration from cultured 
protoplasts is described in Evans et aL, Protoplasts Isolation and Culture, Handbook 
of Plant Cell Culture, pp. 124-176, MacMillian Publishing Company, New York, 
1983; and Binding; Regeneration of Plants, Plant Protoplasts, pp. 21-73, CRC Press, 

20 Boca Raton, 1985. Regeneration can also be obtained from plant callus, explants. 

organs, or parts thereof. Such regeneration techniques are described generally in Klee 
et aL, Ann. Ref of Plant Phys. 38:467-486 (1987) 

The methods of the present invention are particularly useful for incorporating 
25 the zmet2a and/or zmet2b methyltransferase nucleic acid(s) into transformed plants in 
ways and under circumstances which are not found naturally. In particular, the 
zmet2a and/or zmet2b methyltransferase(s) may be expressed at times or in quantities 
which are not characteristic of natural plants. 

30 One of ordinary skill in the art will recognize that after the expression cassette 

is stably incorporated in transgenic plants and confirmed to be operable, it can be 
introduced into other plants by sexual crossing. Any of a number of standard 
breeding techniques can be used, depending upon the species to be crossed. 
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The hereinbefore described expression cassettes can be inserted into a plant m 
order to reduce or alter the amount of DNA methylation in a plant. Preferably, such 
an expression cassette contains the zmet2a and/or zmet2b methyltransferase gene(s) 
inserted into the cassette m the antisense direction as described earlier. A reduction or 
alteration m the amount of DNA methylation m a plant can be used to stabilize 
transgene expression in a transgenic plant. 

One of the difficulties with the production of transgenic plants is that many 
transgenes are silenced or are not stable through successive generations. In many 
cases, transgene silencing is associated with increased DNA methylation. The 
hereinbefore described expression cassettes of the present invention containing the 
zmet2a and/or zmet2b methyltransferase gene(s) in the antisense direction can be 
inserted into a plant either before, concurrently with or after the insertion of another 
expression cassette containing a transgene which is to be expressed in the plant, such 
as, but not limited to. a resistance or drought tolerance gene, etc. The antisense RNA 
produced by the hereinbefore described expression cassette can then form a complex 
with the endogenous mRNA from the zmet2a and/zmet2b methyltransferase gene(s) 
within the plant. This complex should reduce or alter the amount of DNA 
methylation occurring in vivo in the plant. This reduction in DNA methylation should 
prevent the silencing of the desired transgene in the plant. 

In a similiar manner, the expression cassettes described herein can be used to 
modify or alter the yield or biochemical qualities of a plant. As discussed earlier, 
certain genes in plants and animals are expressed differentially when transmitted 
thorough a male versus female parent. This phenomenon is known as imprinting. 
Imprinting is an epigenetic system correlated with DNA methylation. A reduction or 
alteration of DNA methylation in a plant by transforming a plant with an expression 
cassette containing the zmet2a and/or zmet2b methyltransferase gene(s) in the 
antisense direction may affect the yield and biochemical qualities of a plant. 

The hereinbefore described expression cassettes can also be used to silence the 
expression of a particular targeted gene in plants in vivo. More specifically, the 
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expression cassettes of the present invention containing a tissue-specific promoter and 
the zmet2a and/or zmet2b methyltransferase gene(s) in the sense direction can be 
inserted into a plant. The tissue-specific promoter will direct expression of the 
zmet2a and/or zmet2b methyltransferase gene(s) in a area containing the desired 
5 targeted gene. Translation of the zmet2a and/or zmet2b methyltransferase gene(s) m 
the specific area will result in an increase in methylation in the area of the targeted 
gene. This increase m methylation can silence the targeted gene. 

Transgenic plants containing the expression cassettes described herein and 
0 which exhibit a reduction in DNA methylation can be identified by using methylation 
sensitive restriction enzymes or High Performance Liquid Chromatography. 
Techniques for using methylation sensitive restriction enzymes and High Performance 
Liquid Chromatography are well known in the art. Transgenic plants containing the 
expression cassettes described herein and which exhibit an increase in DNA 
5 methylation can be identified by using a Northern Blot analysis which is well known 
in the art. 

Additionally, the hereinbefore described expression cassettes can be used in 
gene therapy for human diseases which are caused by the amplification of 
0 trinucleotide repeats. 

The following Examples are offered by way of illustration, not limitation. 
EXAMPLES 

5 EXAMPLE 1 -Cloning and Sequencing of Zmet2a 
a. Cloning and Sequencing 

A partial cDNA clone (CGET064) from an immature tassel cDNA library was 

obtained from Pioneer Hi-Bred International (Des Moines, Iowa). This clone was 
identified in an expressed tag sequence (hereinafter "EST") database using known 
0 DNA methyltransferase sequences for comparison. This original cDNA clone 
contained sequences from bp 151 to bp 2569 shown m FIG. 1A and IB. The 
sequence of this clone, which represents the 3 f end of the transcript was used to design 
forward and reverse primers for 5 f and 3' Rapid Amplification of cDNA Ends 
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(hereinafter "RACE"). RACE was conducted using the Marathon cDNA 
Amplification Kit (available from Clontech) on cDNA prepared from Mo 1 7 10 day 
old seedling mRNA. Mo 17 is publically available from the National Seed Storage 
Lab (Fort Collins, Colorado). RACE products were isolated and ends sequenced 
5 using Marathon primers and gene specific primers. The remaining sequence was 

obtained from PCR products by primer walking. The primers used were AP2, IF, 1R, 
2R, 3R. 4F, 5F. 8R, 8F. 9R, 9F, 14F, 17F. and RaceRT (see FIG. 3). Two sequencing 
passes were made on the Mo 17 cDNA ends and four sequencing passes were made on 
the intervening regions, three from Mo 17 cDNA and one from B73. B73 is publically 
10 available from the National Seed Storage Lab (Fort Collins, Colorado). A consensus 
sequence for the coding region was generated and is shown m FIG. 1 A and IB. 

Genomic sequence spanning primers IF and 1 R were obtained from Pioneer 
Hi-Bred International. To obtain the remaining genomic sequence of zmetla, the 

15 CGET064 clone was used to probe a Mo 17 genomic library (Stratagene). Lambda 
clones 4a, 4c, 4di and 4d2 were determined to be positive clones containing a 
sequence identical to CGET064. Lamda clone 4a did not contain the full length gene, 
therefore, sequence data was obtained from clone 4c. No analysis of clones 4dl or 
4d2 was conducted. Clone 4c was subcloned into pGEM7zf(+) (Prome^a) using 

20 double digests involving Hindllh Xhol EcoRI, and BamHl. Genomic sequence was 
obtained from a combination of subclones pHX8 (bp 73 1 1-8878), pHX9 (bp 9173- 
10135), and pBl l(bp 5269-8447) and by primer walking using primers T7, Sp6, 
M13F, M13R, Seq2FN, Seq2RN, S3F. S3R, 7F, 8eR, 9F. 9R, 1 HR, 1 liF, 12iR, 12iF, 
13iR, 13iF, 14F, I4R, 15R, 15F, 16R, 16F, 17R, 17F, 18R, 18F, and RaceRT (see 

25 FIG. 3). Borders of the Mu insertion of zmet2a::MU 1 were sequenced from PCR 

products using primer 5F and a Mu primer (see FIG. 3). Map locations of the zmet2a 
primers are shown m FIG. 5. 

PCR products were sequenced using Big Dye terminator cycle sequencing on 
30 an ABI sequencer (Perkin-Elmer Applied Biosystems) at the University of Wisconsin 
Biotechnology Center Sequencing Facility (Madison, WI). Sequence data was 
processed using computional tools available through the World Wide Web 
(hereinafter. "WWW"), summarized in FIG. 6. 
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b. Mutant Analysis 

A mutant allele called (zmet2a;:Mul) was obtained from Pioneer Hi-Bred 
Internationars TUSC system. This mutant allele contains a Mutator transposable 
5 element insertion and was identified in a Mutator population using a Mu specific 
primer and a zmet2a gene specific primer. Since the Mutator population is quite 
variable, heterozygous zmet2a:;mul F : seed was advanced by selfmg at the 
University of Wisconsin West Madison Agronomy Farm (Madison. Wisconsin), the 
University of Wisconsin Walnut Street greenhouses (Madison, Wisconsin), and at the 
10 University of Wisconsin winter nursery in Puerto Rico to produce the F 4 derived F 5 
segregating family primarily used in this example. 

DNA from 15 plants of the F 4 derived F 5 segregating family was used for 
HPLC analysis. A subset of these plants was used for Southern analysis. The 5 th to 

15 7 [h immature leaf tips were collected and immediately frozen m dry ice. Tissue was 
ground in liquid nitrogen and DNA was extracted using a modified CTAB method of 
Saghai-Maroof et al. (Proc. Natl Acad. ScL USA 81:8014-8018 (1984)). Tissue was 
incubated m CTAB (Sigma) extraction buffer for 2 hours at 65 °C, extracted with 
chloroform/isoamyl alcohol, treated with 0.5 mg RNase A (Sigma) for 30 minutes at 

20 37 °C, extracted again with choroform/isoamyl alcohol, precipitated with 

isopropanoL washed with lOmM ammonium acetate/76% ethanol and resuspended in 
TE. 

Plants were genotyped by Southern analysis. DNA (10ug) was digested with 
25 BamUl and EcoRl which cut on each side of the Mu insertion. The digested DNA 
was eiectrophoresed through a 0.8% agarose 0.5X TBE gel. DNA was transferred to 
Immobillon nylon membrane (Miilipore) with 5X SSC. Blots were UV cross-linked 
for 25 seconds and dried at 80 °C for 1.5 hours. Pre-hybridization was carried out in 
5X SSC, 50mM Tns pH 8.0, 0.2% SDS, 10 mM EDTA, 2.5X Denhardts solution, and 
30 0.1 mg/ml single stranded sheared herring DNA overnight (8-16 hours) at 65 °C. 

Hybridization conditions were similar to pre-hybridization except for the addition of 
5% dextran sulfate to the hybridization solution. Probes (25-50 ng) (clone CGET064 
for genotypmg) were radioactively labeled using a random priming reaction 
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containing 50 jiCi of P-32 labeled dCTP. Following overnight hybridization at 65 °C. 
blots were washed 2X (0 15X SSC. 0.1% SDS) for 30-45 minutes at 65 °C. 
Hybridized blots were then exposed to Kodak Biomax film, 

5 Southern analysis with methylation sensitive restriction enzymes was 

conducted in a similar manner except that 5 ug of DNA was digested. Enzymes 
included in the study were; Apal Avail Bamiil Bglll BstNL ClaL EcoO\09. EcoRI, 
EcoRlh Haellh Hinft. Hhah Hpalh Mspl PstL PvulL SacL Saw3a, Scr¥\, SmaL 
XIiol. Probes for repetitive sequence regions of the maize genome including a 9 kb 

10 clone for the maize 26s-5.8s-l 7s repeat (reviewed in McMullen et aL Molecular 

Analysis of the Nucleolus Organizer Region in Maize. In. Chromosome Engineering 
in Plants: Genetics. Breeding, and Evoluation. Gupta PK. Tsuchiya T. (eds). pp. 561- 
576 (1991)), the 5s nbosomal subunit clone (Mascia et aL. Gene, 15:7-20 (1981)), and 
centromere probe pSau3a9 (Jiang et aL. Proc. Natl. Acad. Sci. USA 93:14210-14213 

15 (1996)) were used to analyze changes in methylation due to zmet2a::MuL 

HPLC was conducted according to a modified protocol of Gehrke et aL, {J 
Chromat. 301:1 99-2 19 (1 984)). Duplicate preparations for each of fifteen plants were 
analyzed. Twenty-five micrograms of DNA was diluted with water to a volume of 50 

20 jul, denatured at 96 °C for 5 minutes and immediately placed on ice. One hundred 
microliters of 30mM ammonium acetate (pH 5.3), 5 ul of 20mM Zinc Sulfate and 10 
ul Nuclease PI (lmg/ml in 30mM ammonium acetate (pH 53) was added and 
incubated at 37 °C for 2 hours. This reaction cleaves 5 1 mononucleotides from single 
stranded DNA. The pH was adjusted with 20 \x\ of Tris (pH 8.5) and approximately 

25 15 units of Calf Intestinal Alkaline Phosphatase was added and incubated at 37 °C for 
an additional 2 hours which converts the nucleotides to nucleosides. Samples were 
frozen at -20 °C until HPLC analysis. 

HPLC analysis was conducted at the University of Wisconsin Biotechnology 
30 Center. A volume of 50 |il was injected into a Brownlee Lab Spheri-5 RP-8 column. 
Nucleosides were separated with a flow rate of 0.75 ml/mm using a gradient program 
consisting of 30 minutes m buffer A (0.05M Potassium Phosphate pH 4.0, 2.5% 
methanol), 19 minutes m buffer B (0.05M Potassium Phosphate pH 4.0, 20% 
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methanol). The column was flushed with 70% methanol for 13 minutes and then re- 
equilibrated with buffer A for 23 minutes before the injection of the next sample. All 
samples were analyzed on a Beckman System Gold chromatograph and nucleosides 
detected at A260nm and A280nm. Nucleoside and nucleotide standards (Sigma) were 
5 used to determine nucleoside peak positions and to create a standard curve to 

determine nucleoside concentration. The ratio of 5-methylcytosme to total cytosme 
was calculated and statistical analysis conducted using SAS. 

To test remethylation as an indication of de novo methylase activity, an Fi 
10 hybrid of an F 4 line homozygous for zmet2a::Mul and the inbred line Mol 7 was 

backcrossed to the nonmutant Mol 7 parent to generate plants homozygous wiid-type 
and plants heterozygous for zmet2a::MuL Seedlings of the Fi, the BC] progeny, the 
Mo 17 parent and a sib of the F 4 zmet2a::Mul parent were grown m the greenhouse 
and DNA extraction and Southern analysis conducted as previously described. DNA 
15 was digested with Mspl and Pstl and probed with the aforementioned repetitive 
clones. 



c. Expression Analysis 

The expression of zmet2a was determined by hybridizing the zmet2a cDNA 
20 probe to a Southern biot of cDNA's prepared from different tissues and tissues at 
different stages of development. Tissues included m this study are embryos 24 days 
after pollination, 10 day seedlings, immature ear, immature tasseL immature leaf from 
mutant and nonmutant plants, and roots. Total RNA was extracted using Trizoi 
(Gibco/BRL) according to the manufacture's protocol. The Poly Attract System 
25 (Promega) was used to isolate mRNA's from all tissues except 10 day seedlings 
which was isolated using oligo dT cellulose columns (Pharmacia). cDNA was 
synthesized from the isolated RNA's using Marathon cDNA Amplification Kit 
(Clontech). 



30 d. Results 

zmet2a shares sequence similarity with other DNA methyltransferases 

zmet2a is a member of a small gene family. Three cohybridizmg bands are 
observed on a Southern blot of B73 DNA digested with Hindlll and probed with 
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cione CGET064 which does not contain a Hindlll restriction site (see FIG. 7). 
zmet2a. which maps to the iong arm of chromosome 10, is coded on 20 exons with 19 
intervening introns (FIG. 5). The inferred protein using the first predicted translation 
start site located within a consensus Kozak sequence (Kozak, J, Cell BioL, 1 1 5:887- 
5 903 ( 1991)) is composed of at least 912 ammo acids with a predicted mass of 101 Kd 
(Kilodaltons). A protein of this size with an affinity for CpNpG sequences was 
isolated in Pisum sativum by Pradhan and Adams (Plam J., 471-481 (1995)). 

Comparisons with Arabidopsis chromomethylase, CMT1 

10 Sequence of zniet2a (FIG. 1 A and IB) reveals that it lacks the large N- 

terminal domain found in the maintenance enzymes but does possess the six highly 
conserved motifs of the C-terminal catalytic domain. Database searches using 
BLAST (http://www.ncbi.nlm.nih.gov/gov/BLAST/) show that zmet2a has highest 
sequence homology to the Arabidopsis chromomethylase, CMT1 (see Henikoff and 

15 Comai, Genetics, 148:307-318 (1998)) with 44% identity, 57% conservation. The N- 
terminal region is larger in zmet2a; however, there is an additional downstream 
predicted start site, also within a consensus Kozak sequence, that codes for an enzyme 
of 809 amino acids which is more similar in size to the most closely related CMT1 
which is composed of 791 amino acids. 

20 

Alignments of zmet2a with CMT1 and the catalytic domains of Arabidopsis 
MET1 and maize zmetl maintenance enzymes show conservation in the important 
functional motifs I, IV, VI, VIIL IX and X providing evidence that it is indeed a 
DNA methyltransferase (FIG. 8). zmet2a and CMT1 are 87% conserved across the 

25 defined six conserved domains, as shown in the underlining in FIG. 8. Zmet2a and 
CMT1 also have 60% conservation in the variable region sequence between the 
defined underlined motifs VIII and IX in FIG. 8, which contains a region known as 
the target recognition domain in the bacterial methyltransferases. The bacterial 
methylase M.Hhal has been crystalized and functions deduced for the conserved 

30 ammo acids (Cheng et aL Cell 74:299-307 (1993)). The zmet2a ammo acids 

involved in catalysis were predicted by comparison to M.HhaL The ammo acids 
interacting with SAM and with cytosine are summarized m FIG. 9. 
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zmet2a mutant plants have reduced methylation at CpNpG sites 

A reverse genetics approach was used to ascertain the function of zmet2a. A 
F2 family segregating for a Mutator (Ma) insertion in the exon encoding motif IX was 
identified using a PCR primer for Mu and a gene-specific primer for zmet2a. This 
5 allele is called zmet2a::Mul . The insertion of Mu into exon 19 results m a transcript 
that would code for a protein truncated at the point of the Mu insertion in motif IX 
due to the introduction of a stop codon. The resulting protein is expected to be 
dysfunctional since it lacks Motif X which is required for S-Adenosyl methionine 
(hereinafter "SAM") binding (Cheng et al. Cell 74:299-307 (1993)). 

10 

Reduced methylation observed by restriction enzyme analysis 

To reduce the genetic background variation associated with the heterogeneous 
origin of the Mutator population, restriction enzyme analysis was conducted on a F 4 
derived F5 family segregating for zmet2a::Mul. Restriction enzyme isoschizomers 

15 Hpall/Mspl m addition to other methylation sensitive enzymes were used to 

determine methylation pattern differences among the three genotypic classes. Hpall 
and Mspl both recognize the sequence CCGG but differ m their sensitivity to 
methylation. Hpall digestion is inhibited unless both cytosines are unmethylated 
whereas Mspl can digest C me CGG sequences but not me CCGG sites. The methylation 

20 status at CpG sites can be accessed by digesting with Hpall and similarly Mspl 

digestion is used to determine the state of methylation at CpCpG sites specifically and 
may provide a general indication of methylation changes occurring at CpNpG sites. 

Results indicate significant reductions in cytosme methylation at me CCG sites 
25 as indicated by a more complete digestion by Mspl in plants homozygous for 

zmet2a::Mul (FIG. 10A-C). Plants heterozygous for zmet2a::Mul were intermediate 
m their digestion pattern. Although the frequency of methylated cytosines is much 
higher at CpG sequences, no changes in methylation were observed among the 
genotypic classes when digested with Hpall (FIG. 10 A-C). 

30 

Isoschizomers. BstNl and £coRII recognize the sequence CC(A/T)GG. BstNl 
is not sensitive to cytosine methylation and EcoRll is inhibited at C me C(A/T)GG sites. 
Nearly all of these sites are methylated in repetitive sequences as a low level of 
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EcoRll digestion is observed only in zmet2a;:Mul plants (See FIG. 1 1), whereas 
digests with BsfN] are completely digested to lower molecular weight fragments for 
all genotypes. These methylated sites may not be subject to zmet2a activity but may 
instead be methylated by another member of the zmella gene family or by zmetl or 
5 possibly de novo methylated after each cell cycle by zmet3. Other restriction enzymes 
were used to clarify the apparent sequence specificity of rnethylation reduction at 
CpNpG sites. As with the isoschizomers, no digestion differences are observed with 
CpG sensitive enzymes Hhal [G me CGC] and CM [AT me CGAT], More complete 
digestion is observed m plants homozygous for zmet2a::Mul with enzymes sensitive 

10 to rnethylation at CpNpG sites. FIG. 12 shows digestion patterns for enzymes 

sensitive to rnethylation at CpNpG sites: EcoRll, Bglll, Pstl, BamHl, and Avail. In 
addition to EcoRll as previously mentioned, reduced rnethylation in one or more of 
the repetitive regions was observed with Bglll [AGAT me CT], Pstl [ me CTGCAG], 
Bamtil [GGAT me CC], and Avail [GG(A,T) me C me C]. It should be noted thai Avail 

\5 may include some CpG overlapping sites. Subtle differences in digestion patterns of 
one or more of the repetitive sequences were also observed with SauSal [GAT me C], 
Apal [GGG me CC me C], and Xhol [ me CT me CGAG]. With these enzymes it is not 
possible to unambiguously determine whether the source of the difference is CpG or 
CpNpG rnethylation. Differences were also observed with ScrFl [C me CNGG] which 

20 duplicates the targeted sequences and rnethylation sensitivities of EcoRIL Msp\ and 
Hpalh Although in many cases the observed reduction m CpNpG or CpN 
rnethylation is minimal, any cases of reduced rnethylation that could be 
unambiguously attributed to CpG sites have not been observed. 

25 Reduced rnethylation observed by HPLC 

To further assess the extent of rnethylation reduction caused by the 

zmet2a::Mul allele, HPLC was used to determine the proportion of methylated 

cytosines in the same F5 plants used for restriction enzyme analysis. An 1 1.6% 

decrease in 5-methylcytosine was observed in plants homozygous for zmet2a::Mul 

30 relative to siblings homozygous for wild-type zmet2a (FIG. 12). Heterozygotes were 

intermediate in 5-methylcytosine content. Differences between the genotypic classes 

are statistically significant at a< 0.0001. Since most methyiation is found at CpG 

sites (Gruenbaum et aL, Nature, 292:860-862 (1981)), a 12% decrease m the total 5- 
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methylcytosine content likely accounts for a substantial reduction in methylation at 
CpNpG sites if the reductions are confined to these sequences. 

Several generations of inbreeding does not reduce methylation levels beyond 
5 that which is observed in the F 2 homozygous mutant (FIG. 13). In addition, it was 
also observed that plants restored to a normal zmet2a genotype from zmet2a::Mul 
heterozygotes appeared to have near normal levels of methylation. 

Methylation is restored after segregation away from zmet2a;:Mul 

10 To test remethylation. a nonmutant line. Mo 17. was crossed to a homozygous 

mutant line, the resulting Fi was then backcrossed to the nonmutant Mo 17 parent line. 
Restriction enzyme analysis of backcross progeny show all individuals without the 
Mu insertion have remethyiated to levels similar to the backcross parent (see FIG. 14). 
The increased levels of methylation observed in normal BC t progeny appear to be 

15 higher than that expected from the segregation of normal Mol 7 derived chromosome 
segments and low methylation mutant segments, which would result in a pattern 
intermediate between the Fi and the nonmutant parent. These results indicate either 
that zmet2a has in vivo de novo activity and is responsible for establishing CpNpG 
methylation patterns, or that a separate de novo methyltransferase functions only early 

20 in development and that zmet2a is responsible for maintaining these patterns. These 
results on remethylation are m contrast to those of the reduced methlyation patterns of 
Arabidopsis mutants. Backcross progeny, lacking an antisense MET1 transgene or 
the ddml mutation, derived from mutant plants outcrossed to normal plants showed 
very slow remethylation and required several generations to restore methylation to 

25 normal levels (Ronemus et al. Science, 273:654-657 (1996), Vongs et aL, Science, 
260:1926-1928 (1993), Kakutani et aL Genetics, 151:831-838 (1999)). Similar 
results were observed in selfed progeny from hemizygous antisense Metl plants that 
did not inherit the transgene (Fmnegan et aL. Proa Natl. Acad. Scl USA 93:8449- 
8454 (1996)) however a centromeric region and some single copy sites did 

30 remethylate in the first generation (Fmnegan et aL, Annu. Rev. Plant Physiol Plant 
Mol Bio.. 49:223-247 (1998)). 



44 



WO 00/53732 



PCT/US00/06456 



Other DNA methyltransferases that lack the large N-terminai domain have 
been presumed to be de novo enzymes, however, evidence remains insufficient. In 
vitro expression of Dnmr3a and Dnmt3b (Okano et aL Nature Genetics, 19:219-220 
(1998)) did not show a specific preference for hemimethyiated DNA or 
5 nonmethylated DNA and in vivo expression m Drosophila (Lyko et aL Nature 

Genet.* 23:363-366 (1999)) further confirm de novo activity, whereas Dnmtl (Okano 
et ah. Nucleic Acids ites., 26:2536-2540 (1998)) was shown not to effect de novo or 
maintenance methylation in mice. MascL m ascobo/us. is purported to have de novo 
activity through its effect on methylation induced premeiotically (MIP) (Malagnac et 
10 aL Cell 91:281-290 (1997)). Another Ascoboius methyltransferase Masc2 was 

found to be dispensible for maintenance and de novo methylation in vivo (Malagnac 
et zlAIol. Micro, 3:331-338 (1999)). 

A chromodomain is present in zmet2a 

15 A distinguishing feature of zmet2a, like CMTL is the presence of the 

chromodomain. Chromodomains have been demonstrated to target proteins to 
heterochromatic regions and may also be a site of protein-protein interactions 
(reviewed by Cavalh and Paro. Curr. Op. Cell Biol, 10:354-360 (1998)). The 
presence of the chromodomain in zmet2a and CMT1 potentially suggests targeting of 

20 the methyltransferase to chromatin complexes or a role of the methyltransferase in 
chromatin formation and stability. Furthermore, the observation that zmet2a affects 
CpXpG methylation may also implicate protein targeting through the chromodomain 
and targeting of methylation patterns. Stable transcriptionally active or silent states 
may be determined by the formation of chromatin complexes. The mechanisms 

25 involved in the formation of silencing complexes remain unknown. However, there is 
evidence of the involvement of methylation in transcriptionally silenced states which 
involve methylation binding proteins, transcriptional repressor complexes, and histone 
deacetylases (Nan et aL, Nature, 393:386-389 (1998), Wade et al. Nature Gen.. 23:62- 
66 (1999), Ng et aL Nature. Gen. 23:58-61 (1999)). 

30 

zmet2a is expressed throughout plant development. Expression is higher in 
the rapidly dividing tissues of seedling, immature ear and embryos (FIG, 15) 
consistent with the role of methyltransferases m methylating newly synthesized DNA. 
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Low expression of zmet2a in terminal tissue (leaves) could serve a protective function 
against invading DNA if this enzyme does have a de novo function. 

Example 2 - Cloning and Sequencing of the maize retrotransposon SPRITE-1 

5 This example describes the cloning and sequencing of a maize retrotransposon 

that is inserted into an intron of zmet2a and is referred to herein as ''SPRITE- 1". 

a. introduction 

Within the genomes of most organisms are DNA elements that can be 
10 considered parasitic. These elements confer no phenotype of their own and function 
only for their propagation and insertion elsewhere m the genome. There are two 
major classes of these elements based on the mechanisms of propagation. One class 
propagates using DNA-mediated mechanisms where the element does not code for 
any polymerase and entirely depends on the replication machinery of the host. This 
15 class includes the Ac, Spm, and Mu transposable element systems. The other major 
class is known as retrotransposons, retrotransposable elements or retroelements 
(reviewed in Grandbastien, Trends in Genetics 8:103-108 (1992); Eickbush, Origin 
and Evolunonaiy Relationships of Retroelements. In The Evolutionary Biology of 
Viruses (Morse, S.S.. ed).) (1994); Wessier et ah, Current Biology?, 5:814-821 (1995); 
20 Bennetzen, Genome. 37:565-576 (1996)). These elements are not able to excise from 
one site and insert into another, as the previously mentioned class is capable, but 
replicate by an RNA-mediated process. The retroelements code for a reverse 
transcriptase which is a DNA polymerase that uses RNA as a template. 

25 There are several types of retroelements. The main types are retroviruses, 

long-terminal-repeat (hereinafter "LTR") retroelements, and non-LTR retroelements. 
Retroviruses are infectious and have not been found in plants, although one plant 
LTR-retroelement, SIRE-1 from soybean has coding sequences similar to that of a 
retoviral envelope protein (Laten et aL, Proc. Natl Acad. ScL, 95:6897-6902 (1998)). 

30 The non-LTR class is mainly composed of long interspersed nuclear elements 

(hereinafter "LINEs") and short interspersed nuclear elements (hereinafter w< SINEs"). 
These elements have been found in plants. Less is known about this class than the 
others. They do differ from LTR-retroelements in that they contain a poiy-A tail at 
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their 3' end. The LTR-retroelement class has been more extensively described in 
plants than the other classes of retroelements. The LTR-retroelements are usually 
categorized as one of two groups based on the similarity with the first elements 
described in yeast and Drosophila. One group shares similarity with the Ty3 
5 elements from yeast and the gypsy element of Drosophila (Marlor et aL, MoL Cell. 
Biol., 22:829-846 (1986); Clark et aL. J. Biol Chem. % 263:1413-23 (1988)). The other 
group has similarity with the Tyl elements of yeast and the copia element of 
Drosophila. The element identified in this study is of the Tyl /copia class (Clare and 
Farabaugh, Proc. Natl Acad. Sci. USA, 82:2829-2833 (1985); Mount and Rubin. MoL 
10 Cell. Biol 5:1630-1638 (1985)). 

The general structure of a LTR-retroelement is depicted in FIG. 16A. These 
elements are similar in their structure and replication to retroviruses (reviewed in 
Witcomb and Hughes, Ann. Rev. Cell Biol., 8:275-306 (1992), Eickbush, Origin and 

1 5 Evolutionary Relationships of Retroelements. In The Evolutionary Biology- 7 of Viruses 
(Morse, S.S., ed.)- New York: Raven Press, pp 121-157 (1994), Bennetzen, Trends in 
Microbiology, 9:347-353 (1996)), These elements have direct repeats at the termini 
as opposed to the DNA based elements that have inverted terminal repeats. 
Downstream from the 5 J LTR is a primer binding site for a host tRNA that pnmes the 

20 first DNA strand synthesis using reverse transcriptase. One or more open reading 

frames that code for gag^ a protease, an integrase, a reverse transcriptase, and RNaseH 
are located downstream from the primer binding site. After the coding region is a 
poiypurme tract followed by the 3' LTR. Ty3/ gypsy and Tyl! copia elements differ in 
the postion of the integrase coding region. Ty3/gypsy element have the integrase 

25 domain at the end of the coding region whereas Tyl /copia element have it positioned 
between the proteinase and reverse transcriptase regions. The gag gene encodes 
proteins for the nucleocapsid and the highly conserved cysteine-histidme nucleic acid 
binding domain (CX2CX4HX4C). The protease processes the polyprotein into its 
individual components. The integrase functions to insert a newly repiicated element 

30 into the host DNA. The reverse transcriptase synthesizes the first DNA strand from 
the transcribed RNA of the element. The RNase degrades the RNA following first 
strand synthesis. Retroelements rely on the RNA polymerase of the host for 
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transcription and the host DNA polymerase for second strand DNA synythesis to 
complete replication. 

Using PCR based methods, retroelements were found within nearly every 
5 species of the plant kingdom studied (Flaveli et aL Nuc. Acids Res. 20:3639-3644 
(1992); Voytas et aL, Proc. NatL Acad. Sci. USA 89:7124-7128 (1992)). Despite the 
ubiquitous nature of retroelements, there is great heterogeneity among the element 
within and among species (Flaveli et a!.. Nuc. Acids Res, 20:3639-3644 (1992), Wang 
et ah. Plant MoL Biol., 33:1051-1058 (1997). Pearce et aL, MoL Gen. Genets 
10 250:305-315 (1996)). 

Retroelements are found to be distributed over the entire lengths of 
chromosomes vaAvena saliva (Katsiotis et aL, Genome, 39:410-417 (1996)) but have 
also been found to be less abundant in heterochromatin, nucleolar organizer regions, 

15 centromeres and telomeres (Pearce et aL, Mol Gen, Genet., 250:305-315 (1996); 
Moore et aL, Genomics, 10:469-476 (1991); Aledo et aL, Theor. Appl Genet.. 
90:1094-1100 (1995); Brandeis et aL, Plant MoL Biol, 33:11-21 (1997)). 
Retroelement-like sequence were found in centromeric regions of grass chromosomes 
(Miller et aL, Genetics, 150:1615-1 623 ( 1 998)). Many retroelements were discovered 

20 by their associations with plant genes (Johns et aL, EMBOJ., 4:1093-1102 (1985); 
Grandbastien et aL, Nature, 337:376-380 (1989); Camirand et aL, MoL Gen. Genet., 
224:33-39 (1990)); White et aL. Proc. NatL Acad. Sci. USA, 91:1 1792-1 1796 (1994)); 
Hu et aL, MoL Gen. Genet., 248:471-480 (1995); Bi and Laten, Plant MoL Biol, 
30:1315-1319 (1996), Royo et zl,MoL Gen. Genet., 250:180-188 (1996); Kumekawa 

25 et aL, MoL Gen. Genet., 260:593-602 (1999)). Many more retroelements or 

retroelement fragments have been identified using PCR with degenerate primers 
(Voytas et aL, Proc. NatL Acad. Sci. USA, 89:7124-7128 (1992)); Flaveli et aL, Nuc. 
Acids Res., 20:3639-3644 (1992); Flaveli et aL, MoL Gen. Genet., 231-233 (1992), 
Pearce et aL, MoL Gen. Genet., 250:305-315 (1996); Katsiotis et aL, Genome, 39:410- 

30 417 (1996); Wang et aL, Plant MoL Biol, 33:1051-1058 (1997)). Others have been 
identified through studies for other purposes (Bhattacharyya et aL, Plant MoL Biol, 
34:255-264(1997); Vicient and Martmez-lzquierdo, Gene, 184:257-261 (1997); 
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Mannmen and Schulman, Plant MoL Biol. 22:829-846 (1993)) or by genome 
sequencing projects. 

The Ty?>l gypsy and the Ty Mcopia elements can be found in large numbers and 
5 may contribute up to 50% of the nuclear DNA of the maize genome (SanMiguel et aL, 
Science. 274:765-768 (1996)). A 280 Kb region of the maize genome containing the 
Adhl-F and u22 genes was composed of retroelements, from 10 different families, 
inserted within each other. The copy number of Ty Mcopia elements varies 
considerably. For example, the Tal elements of Arabidopsis fVoytas et al., Genetics, 
10 126:713-721 (1990)) and the Tstl element of Solanum tuberosum (Camirand et al., 

MoL Gen. Genet., 224:33-39 (1990)) have one to only a few copies whereas the maize 
element PREM-2 (Bennetzen, Trends in Microbiology^ 9:347-353 (1996)) and the 
BARE-1 element of Hordeum vulgare (Mannmen and Schulman, Plant MoL BioL, 
22:829-846 (1993)) may be present at 30,000 or more copies. 

15 

The differences in copy number infer differences in expression of 
retroelements. Retroelements are not expressed at high levels as only a few examples 
of activity have been observed. The Bsl and Zeon-1 elements of maize (Johns et al., 
EMBOJ., 4:1093-1102 (1985); Hu et aL MoL Gen. Genet, 248:471-480 (1995)); the 

20 Tos elements of rice (Hirochika et aL, Proc. Natl Acad, ScL USA 93:7783-7788 

(1996)) the Tntl and Ttol elements of tobacco (Grandbastien et al., Nature, 337:376- 
380 (1989); Hirochika, EMBOJ., 12:2521-2528 ( 1993)) and the Tnp2 element of 
Nicotiana plumbaginifolia have shown evidence of activity. Retroelement expression 
is higher in plant tissues under stressful conditions. The Ttol , Tto2 of tobacco and 

25 Tos 17 element of rice were shown to be activated in tissue culture (Hirochika, EMBO 
J., 12:2521-2528 1993, Hirochika et al, Proc. Natl. Acad. 5c/., USA (1996)). The 
promotors of the BARE- 1 element of barley and the Tnt- 1 element of tobacco drove 
expression of reporter genes in protoplasts (Suoniemi et al. Plant MoL BioL* 31:295- 
306(1996): Pouteau et al., EMBO J., 10:1911-1918 (1991)). 

30 

Biotic stresses such as viral, fungal and bacterial infection and abiotic stress 
such as wounding have also been shown to initiate the expression of Tntl and Ttol 
retroelements (Pouteau et aL, Plant J., 5:535-542 (1994); Moreau-Mhiri et aL, Plant 
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9 : 409-419 (1996); Vernhettes et aL Plant Mol Biol, 35:673-679 (1997); Mhin et 
aL, Plant Mol Biol, 33:257-266 (1997); Grandbastien et al, Genetica, 100:241-252 
(1997); Takeda et al.. Plant Mol Biol. 36:365-376 (1998)). The Bsl element of 
maize may have been mobilized prior to insertion in the Adhl gene by infection with 
5 the barie\ stripe mosaic virus (Johns et al., EMBOJ., 1093-1 102 (1985)). Only the 
expression of BARE- 1 has been observed m normal unstressed barley leaves 
(Suoniemi et ah. Plant Mol. Biol, 31:295-306 (1997)). 



Under normal conditions, retroelements are transcriptionally inactive and are 
10 thus transpositionally inactive. Mechanisms within the host must exist to regulate the 
activity of the retroelements to prevent potentially deleterious mutations that could 
occur if retroelement transposition was unchecked. Most retroelements are highly 
methylated (Bennetzen et al.. Genome. 37:565-576 (1994)) and possibly in 
heterochromatic regions and may not be accessible to transcriptional machinery. 
15 Though silenced in most cases and active in stressful situations, it has been suggested 
that retroelement transposition may create mutations that may be of selective 
advantage and provide a means for adaptation (McClmtock, Science, 226:792-801 
(1984)). 

20 b. Cloning and Sequencing of SPRITE-1. 

A zmet2a genomic clone was isolated from a lambda library (Stratagene) 
constructed from Mo 17 genomic DNA. The sequence was obtained from subclones 
or from PCR products by primer walking. Fragments were sequenced using Big Dye 
terminator cycle sequencing on an ABI sequencer (Perkin-Elmer Applied Biosystems) 
25 at the University of Wisconsin Biotechnology Center Sequencing Facility, Madison, 
Wisconsin. 



Expression analysis was conducted on cDNA's prepared using Marathon 
cDNA Amplification Kit (Clontech) according to the manufacturer's protocols from 
30 mRNA isolated from a Mo 17 10 day old seedling, Mo 17 immature tassel, B73 
immature ear. Black Mexican Sweet (BMS) callus. Mo 17 embryo 24 days after 
pollination, W22 pollen, young roots, and immature leaf tissue from zmet2a normal 
and mutant plants. Total RNA was extracted using Trizol (Gibco/BRL) according to 
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manufacturer's protocol. Seedling mRNA was isolated using oligo dT cellulose 
columns (Pharmacia) all other mRNA isolated using the Poly Attract system 
(Promega). 

c. DNA extraction and Southern analysis for genotyping and methylation 
analysis. 

DNA was extracted from immature leaf blades as described m Saghai Maroof 
et al. (Proc Natl Acad. Sci. USA 81:8014-8018 (1984)). The copy number of 
SPRITE- 1 was determined by digesting DNA ( 10[ig) with EcoRl which does not cut 
0 within the element. The digested DNA was electrophoresed through a 0.8% agarose 
0.5X TBE gel. Geis were treated with 0.25N HC1 for 15 minutes, denatured in 0.2N 
NaOH and 0.6 M NaCl for 30 minutes, then neutralized in 0.5 M Tns 1,5 M NaCl for 
30 minutes. DNA was transferred to Immobilon nylon membrane (Millipore) with 
5X SSC. Blots were dried at 80 °C for 1.5 hours. Pre-hybridization was carried out 
5 in 5X SSC, 50 mM Tris pH 8.0, 0.2% SDS, 10 mM EDTA, 2.5X Denhardt's solution, 
and 0.1 mg/ml single stranded sheared herring DNA overnight (8-16 hours) at 65 °C. 
Hybridization conditions were similar to pre-hybridization except for the addition of 
5% dextran sulfate to the hybridization solution. The blot was probed with a PCR 
fragment (25-50 ng) amplified from the 5' end of the element. Probes were P-32 (50 
0 jiCi) labeled using random priming. Following overnight hybridization at 65 °C, 
blots were washed 2X (0.1 5X SSC, 0.1% SDS) for 30-45 minutes at 65 °C. 
Hybridized blots were then exposed to Kodak BioMax film. Southern analysis with 
methylation sensitive restriction enzymes was conducted on B73 and Mo 17 using the 
same protocols as for genotyping except that 5 fig of DNA was digested. Enzymes 
5 included in the study were the differentially methylation sensitive isoschizomers 
Hpall/Mspl and EcoRlVBstNl as well as other methylation sensitive enzymes: Hhal, 
and Pstl. Blots were hybridized with probes representing different portions of the 
element. 

0 d. HPLC analysis. 

HPLC was conducted according to a modified protocol of Gehrke et al. (J. 
Chromato., 301:199-219 (1984)). B73 x Mol7 recombinant inbred lines carrying a 
SPRITE- 1 insertion were determined using PCR with the zmet2a primers 1 5F and 8R, 
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and the SPRITE- 1 primer 18R. Preparations for each of four plants with and without 
SPRITE- 1 were analyzed. Twenty-five micrograms of DNA was diluted with water 
to a volume of 50 denatured at 96 °C for 5 minutes and immediately placed on ice. 
One hundred microliters of 30 mM ammonium acetate (pH 5.3), 5 ^1 of 20 mM Zinc 

5 Sulfate and 10 fil Nuclease PI (Img/ml in 30 mM ammonium acetate (pH 5.3) was 
added and incubated at 37 °C for 2 hours. This reaction cleaves 5' mononucleotides 
from single stranded DNA. The pH was adjusted with 20 \il of Tris (pH 8.5) and 
approximately 15 units of Calf Intestinal Alkaline Phosphatase was added and 
incubated at 37 C for an additional 2 hours which converts the nucleotides to 

10 nucleosides. Samples were frozen at -20 °C until HPLC analysis. 

HPLC analysis was conducted at the University of Wisconsin Biotechnology 
Center. Madison, Wisconsin. A volume of 40 jul was injected into a Brownlee Lab 
Spheri-5 RP-8 column. Nucleosides were separated with a flow rate of 0.75 ml/mm 

15 using a gradient program consisting of 30 minutes in buffer A (0.05M Potassium 
Phosphate pH 4.0. 2,5% methanol), 19 minutes in buffer B (0.05M Potassium 
Phosphate pH 4.0, 20% methanol). The column was flushed with 70% methanol for 
13 minutes and then re-equilibrated with buffer A for 23 minutes before the injection 
of the next sample. All samples were analyzed on a Beckman System Gold 

20 chromatograph and nucleosides detected at A260 nm and A280 nm. Nucleoside and 
nucleotide standards (Sigma) were used to determine nucleoside peak positions and to 
create a standard curve to determine nucleoside concentration. The ratio of 5- 
methylcytosine to total cytosine was calculated and statistical analysis conducted 
using SAS. 

25 

e. Expression analysis. 

The expression of SPRITE- 1 was determined by hybridizing a SPRITE- 1 probe to 
a Southern blot of cDNA's prepared from different tissues and tissues at different 
stages of development. Tissues included in this study are embryos 24 days after 
30 pollination, 10 day seedlings, immature ear. immature tassel, immature leaf from 

mutant and nonmutant plants, roots. BMS callus, and mature pollen. Total RNA was 
extracted using Trizol (Gibco/BRL) according to the manufacture's protocol. The 
PolyAttract System (Promega) was used to isolate mRNA's from all tissues except 10 
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day seedlings which was isolated using oligo dT cellulose columns (Pharmacia). 
cDNA was synthesized from the isolated RNA's using Marathon cDNA 
Amplification Kit (Clontech). 

5 f. Results 

SPRITE-1 is similar to retrotransposons of the Tyl/copia group. 

In the process of sequencing the maize methyltransferase gene zmet2a, a 
retroelement inserted within an intron of this gene was discovered and names 
SPRITE-1. This element is positioned in opposite transcriptional orientation relative 

10 to zmet2a. The insertion spans 5220 bp and possesses all the components of a 
retroelement. Sequence data indicates that SPRITE-1 is a Long-Terminal- Repeat 
(hereinafter "LTR") retroelement belonging to the Ty l/copia class of retroelements. 
FIG. ! 6a depicts the general structural components of SPRITE-1. FIG. 16b shows the 
sequence of the terminal structural components, SPRITE-1 has a perfect 109 bp 

15 direct terminal repeats which includes a 3 bp inverted repeat that flanks the internal 
element sequence. These repeats have the TG...CA pattern found in most plant 
retroelements and are also shorter than LTR's of most retroelements. LTR's range in 
size from 1 15 bp to 4560 bp from information compiled by Bennetzen (Trends in 
Microbiology, 9:347-353 (1996)). A 5 bp host site duplication flanks the repeats 

20 externally. Downstream and adjoining the 5 f LTR is a primer binding site (PBS) of 
16 bp that has sequence complementary to the wheat germ cytoplasmic initiator 
methionine tRNA (Ghosh et aL Nuc. Acids. Res.. 10:3241-3247 (1982)). Upstream 
and adjoining the 3' LTR is a polypurine tract of 9 bp. Between the putative 
transcription start site to the predicted translation start site is a 550 bp untranslated 

25 region. SPRITE-1 contains a single open reading frame coding 1485 amino acids 
ending with the stop codon at the 5' end of the polypurine tract. 

Database searches for similar coding sequences using BLAST 
(http://www.ncbi.nlm.nih.gov/gov/BLAST/) show that SPRITE-1 belongs to a 
30 different family of retroelements than any other previously described. The most 
closely related elements based on overall amino acid similarity include an 
Arabidopsis retroelement (AC006528), Retrofit from Oryza longstaminata (U72725), 
and Hopscotch from Zea mays (U12626) all having -35% identity and -50% 
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conservation in ammo acid sequence with SPRITE- 1. It also shares 29% identity and 
45% conservation with the copia element from Drosophila. No elements were found 
to have nucleotide similarity with the LTR of SPRITE- 1 further indicating that this is 
a member of a unique family of Ty l/copia type elements. 

5 

SPRITE- 1 has the component retrovirus-like amino acid motifs that code for 
the proteins necessary for transposition. These motifs are the gag-reiated protein that 
contains a Cys-His box also known as the CCHC zinc-binding domain, a protease, an 
integrase. reverse transcriptase and RNase H. These motifs are ordered as they are in 

L0 Ty 1 and copia. FIG. 17 shows amino acid alignments of these conserved region from 
the similar retroelements previously mentioned. These motifs were similarly 
positioned relative to each other in these retroelements except the CCHC zinc binding 
domain which was more variant m position relative to the protease motif. This motif 
was aligned by hand whereas the alignments of the other motifs were constructed by 

15 CLUSTAL W and processed using BOXSHADE. Alignments indicate that SPRITE- 
1 does possess the component protein coding regions necessary for replication and 
transposition. The coding regions of many retroelements have shown mutations that 
create frameshifts or introduce stop codons thus preventing translation of functional 
proteins and preventing transposition. The coding region of SPRITE- 1 is intact and 

20 therefore has the potential to transpose. 

The number of copies of SPRITE-1 is relatively low but variable. 

A survey of inbred lines developed from several different populations and 
other genetic stocks revealed differences in SPRITE- 1 copy number. DNA was 

25 digested with EcoRl and southern blots hybridized with a probe representing the 5' 
untranslated region of SPRITE- L This element does not have any EcoRl restriction 
sites. SPRITE- 1 is found at a low copy number m most maize iines. Copy number 
varies from 3 as in B73 and Mo 17 to 5 as in B 14 and B79 (FIG. 18). The insertion of 
SPRITE-1 into zmet2a is only found in Mo 17 and not in any other maize inbred line 

30 except A682, a line derived from Mo 17 (FIG. 19). C.I. 187-2, a Mo 17 parental line, 
does not contain SPRITE-1. This indicates that SPRITE-1 has been active recently, 
i.e. after the origin of the maize populations used for inbred development. 
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Expression of SPRITE-1 

Expression was investigated by hybridizing a southern blot of cDNAs, 
sythesized from mRNA from different maize tissues, with a SPRITE-1 probe (FIG. 
20). Expression of SRITE-1 was highest in leaf tissue. Expression was highest m 
5 leaf tissue from plants with a MUTATOR insertion in zmet2a and decreased CpNpG 
methylation. A low level of expression was observed m most tissues, but this may be 
due to transcription of other genes containing SPRITE-1 in a sense orientation. 

SPRITE-1 does not effect zmet2a transcript processing. 

10 During the sequencing of zmet2a cDNA, no fragments or subclones possessed 

SPRITE-1 sequence indicating that it is efficiently spliced from the transcript. 
Aberrant splicing has been observed in genes containing retroelements (Pouteau et ah. 
MoL Gen, Genet, 228:233-239 (1991), Varagona et ah, Plant Cell, 4:811-820 (1992), 
Marillonnet and Wessler, Plant Cell, 9:967-978 (1997), Kapitonov and Jurka, J. MoL 

15 EvoL, 48:248-251 (1999)). Expression of three alleles of the waxy gene of maize was 
low due to retroelement insertions within introns (Varagona et al., Plant Cell, 4:811- 
820 (1992)). Varagona et ah (Plant Cell, 4:81 1-820 (1992)) found that although the 
element was spliced out of the waxy transcript, long-range splice site recognition was 
disrupted as exons upstream and downstream of the insertion site were found to be 

20 excluded in some transcripts. Further analysis of the wxG allele showed tissue 
specific differences in RNA processing with more correctly spliced transcripts in 
pollen than in the endosperm (Marillonnet and Wessler, Plant Cell 9:967-978 
(1997)). 

Alternatively spliced transcripts were searched for by PCR amplification of 
25 fragments spanning several exons both upstream and downstream of the SPRITE-1 
insertion site. Fragments were amplified from Mo 17 seedling and immature embryo 
cDNA and compared to fragments amplified from B73 immature ear cDNA (FIG. 
21). Amplification products were separated on an agarose gel and southern blotted. 
The Southern blot was hybridized to a near full length zmet2a cDN A. No differences 
30 were observed between the B73 and Mol 7 products indicating that only correctly 

spliced fragments were detected. The blot was stripped and probed with retroelement 
sequences. No transcripts were amplified that contained any SPRITE- 1 sequence. In 
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the tissues examined in this example, no aberrant transcripts were detected. Aberrant 
splicing products may be at such a low concentration that they are not detectable. 

SPRITE-1 does not effect zmet2a expression and function. 

5 Since SPRITE-1 is inserted into an mtron of zmet2a, the effect of this insertion 

on zmet2a activity was investigated. HPLC data shows no methylation differences 
among the recombinant inbred lines with or without a SPRITE- 1 insertion in zmet2a. 
Lines with a SPRITE-1 insertion had 18.21% ± 1.78 5-methylcytosine whereas lines 
without the insertion had 18.20% ± 0.24. It is probable that most transcripts are 

10 processed correctly since no changes in methylation are observed in plants with a 
SPRITE-1 insertion. 

Regions of SPRITE-1 are hypermethylated 

Portions of SPRITE- 1 were examined to determine the status of cytosine 
15 methylation. Using methylation sensitive restriction enzymes, sites within 970 bp of 
the untranslated region (hereinafter "UTR") immediately downstream from the 
transcription start site was analyzed. FIG. 22 shows methylation sensitive restriction 
digestion patterns for Mo 1 7 and B73. The isoschizomers Hpall and Mspl recognize 
CCGG sequences and are differentially sensitive to methylation. SPRITE-1 has a 
20 single MspliHpall site. Using the SPRITE-1 sequence from Mol7, the zmet2a 
insertion of SPRITE-1 would generate fragments of 5853 bp and 4625 bp. Other 
SPRITE-1 insertions would generate fragments of variable lengths. Southern blots 
show only very large fragments >20 Kb for both Hpall and Mspl. Mspl does show a 
smaller fragment size than Hpall but is much larger than the expected size for the 
25 zmet2a insertion. This indicates that this site is methylated in most SPRITE-1 copies. 

Another pair of isoschizomers BstNI and EcoRll recognize the sequence 
CC(A/T)GG. BstNl is not sensitive to methylation and EcoRll will not cut when the 
internal cytosine is methylated. BstNl should generate SPRITE-1 -specific fragments 
30 of 6, 54, 135, 252, and 784 bp with the UTR probe. All EcoBJl fragments were 
greater than 20 Kb indicating complete methylation of these sites. Hhal which 
recognizes GCGC sites should generate SPRITE-1 -specific fragments of 2884 and 
257 bp and a zmet2a insertion fragment of 2965 bp. No fragments this small were 
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observed indicating methyiation at these sites. The Pstl site recognized with this 
probe was also methylated. 

EXAMPLE 2 - Cloning and Sequencing of zmet2b 

5 A lambda library (Stratagene) constructed from Mo 17 maize genomic DNA 

library was screened with the zmet2a methyltransferase nucleic sequences shown in 
FIG. 1. This screening resulted in the recovery of seven (7) independent clones. Four 
of these clones corresponded exactly to zmet2a nucleic acid sequence. Another type, 
represented by only one clone, had limited homology in non-significant regions. Two 

10 other clones were very similar to the zmet2a methyltransferase nucleic acid sequence 
but were definitely not identical to the zmet2a methyltransferase nucleic acid 
sequence. These clones defined a second gene, referred to as "zmet2b". Primer 
walking resulted in a partial genomic sequence of zmet2b. Primers specific to zmet2b 
were designed and used to amplify zmet2b cDNA (using Marathon cDNA 

15 Amplification Kit from Clontech according to the manufacturer's protocols). The 
RACE products were isolated and cloned into p-GEMT-Easy (Promega), Sequence 
of the RACE products generated a partial cDNA sequence for the 3' end of the gene 
(see FIG. 23). A partial amino acid sequence encoded by this cDNA sequence is 
shown in FIG. 24. A comparison of a portion of the amino acid sequences for zmet2a 

20 and zmet2b is shown in FIG. 25. 

All references cited herein are hereby incorporated by reference. 

The present invention is illustrated by way of the foregoing description and 
25 examples. The foregoing description is intended as a non-limiting illustration, since 
many variations will become apparent to those skilled in the art in view thereof. It is 
intended that all such variations within the scope and spirit of the appended claims be 
embraced thereby. 

30 Changes can be made to the composition, operation and arrangement of the 

method of the present invention described herein without departing from the concept 
and scope of the invention as defined in the following claims. 
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WHAT IS CLAIMED ISi 

1 . An isolated and purified DNA sequence which encodes a Zea mays zmet2a 
methyltransferase and which hybridizes to the nucleic acid sequence shown in FIG. 1 A under 
stringent conditions. 

2. An isolated and purified zmet2a methyltransferase comprising the amino acid 
sequence shown in FIG. 2A. 

3. An isolated and purified DNA sequence which encodes a Zea mays zmet2b 
methyltransferase and which hybridizes to the nucleic acid sequence shown in FIG. IB under 
stringent conditions. 

4. An isolated and purified zmet2a methyltransferase comprising the amino acid 
sequence shown in FIG. 2B. 

5. A recombinant expression cassette comprising the isolated and purified nucleic 
acid sequence of claims 1 or 3, a promoter sequence and a polyadenylation signal sequence, 
wherein the promoter sequence is operably linked to the nucleic acid sequence and the nucleic 
acid sequence is operably linked to the polyadenylation signal sequence, 

6. The recombinant expression cassette of claim 5 wherein the promoter sequence is 
a constitutive or a tissue specific promoter sequence. 

7. A bacterial cell comprising the recombinant expression cassette of claim 5 . 

8. The bacterial cell of claim 7 wherein the bacterial cell is selected from the group 
consisting of Agrobctcterium tumefaciens zn&Agrobacterlum rhizogenes. 

9. A transgenic plant comprising the recombinant expression cassette of claim 5 . 
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1 0. The transgenic plant of claim 9 wherein the promoter sequence and the 
polyadenylation signal sequence is from Cauliflower Mosaic Virus 35S gene. 

1 1 > The transgenic plant of claim 10 wherein transgenic plant is Zea mays, Qryza 
sativa, Secak cereale, Triticum aestivum, Daucus carota> Brassica oleracea, Cucumis me to, 
Cucumis sativus, Latuca sativa, Solanum tubersoutn, Lycopersicon esculentum, Phaseolus 
vulgaris* and Brassica napus. 

12. Seed comprising the recombinant expression cassette of claim 5. 

13. An isolated and purified DN A sequence which encodes a Zea mays zmet2b 
methyltransferase and which hybridizes to the nucleic acid sequence of FIG, 23 under stringent 
conditions. 

14. A recombinant expression cassette comprising the isolated and purified nucleic 
acid sequence of claim 1 3, a promoter sequence and a polyadenylation signal sequence, wherein 
the promoter sequence is operably linked to the nucleic acid sequence and the nucleic acid 
sequence is operably linked to the polyadenylation signal sequence. 

15. The recombinant expression cassette of claim 14 wherein the promoter sequence 
is a constitutive or a tissue specific promoter sequence. 

16. A bacterial cell comprising the recombinant expression cassette of claim 14. 

17. The bacterial cell of claim 16 wherein the bacterial cell is selected from the group 
consisting of Agrobacterium tumefaciens mdAgrobacterium rhizogems* 

18. A transgenic plant comprising the recombinant expression cassette of claim 1 4. 

19. The transgenic plant of claim 1 8 wherein the promoter sequence and the 

polyadenylation signal sequence is from Cauliflower Mosaic Virus 35S gene. 
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20. The transgenic plant of claim 19 wherein transgenic plant is Zea mays, Oryza 
sativa, Secah cereale* Triticum aestivum, Daucus carota, Brassica oleracea, Cucumis melo> 
Cucumts sativus, Latuca sativa, Salanwn tubersoum, Lycopersicon esculentum, Phaseolus 
vulgaris, and Brassica napus. 

21 . Seed comprising the recombinant expression cassette of claim 14. 
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FIG. 1A 
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GAAGCGCGCC GCCAAGGCCG AGGAGATCCA CCAGAACAAG GAGGAGGAGG 
AGGAGGTCGC GGCGGCG7CC TCGGGCAAGC GCAGCCGCAA GGCGGCA7C7 
TCCGGGAAGA AGCCCAAGTC GCCCCCCAAG CAGGCCAAGC CGGGGAGGAA 
GAAGAAGGGG GA7GCCGAGA TGAAGGAGCC CGTGGAGGAC GACGTGTGCG 
CCGAGGAGCC CGACGAGGAG GAGTTGGCCA TGGGCGAGGA GGAGGCCGAG 
GAGCAGGCCA TGCAGGAGGA GGTGGTTGCG GTCGCGGCGG GGTCACCCGG 
GAAGAAGAGG GTGGGGAGAA GGAACGCCGC CGCCGCCGCT GGCGACCACG 
AGCCGGAGTT CATCGGCAGC CCTGTTGCCG CGGACGAGGC GCGC? GCAAC 
TGGCCCAAGC GCTACGGCCG CAGCACTGCC GCAAAGAAAC CGGATGAGGA 
GGAAGAGCTC AAGGCCAGAT GTCACTACCG GAGCGCTAAG GTGGACAACG 
TCGTCTACTG CCTCGGGGAT GACGTCTATG TCAAGGCTGG AGAAAACGAG 
GCAGATTACA TTGGCCGCAT TACTGAATTT TTTGAGGGGA CTGACCAGTG 
TCACTATTTT ACTTGCCGTT GGTTCTTCCG AGCAGAGGAC ACGGTTATGA 
ATTCTTTGGT GTCCATAAGT GTGGATGGCC ACAAGCATGA CCCT AGACGT 
GTTTTTCTTT CTGAGGAAAA GAACGACAAT GTGCTTGATT GCATTATCTC 
CAAGGTCAAG ATAGTCCATG TTGATCCAAA TATGGATCCA AAAGCCAAGG 
CTCAGCTGAT AGAGAGTTGC GACCTATACT ATGACATGTC TTACTCTGTT 
GC AT ATT CT A CATTTGCTAA TATCTCGTCT GAAAATGGGC AGTCAGGCAG 
TGATACCGCT TCGGGTATTT CTTCTGATGA TGTGGATCTG GAGACGTCAT 
CTAGTATGCC AACGAGGACA GCAACCCTTC TTGATCTGTA TTCTGGCTGT 
GGGGGCATGT CTACTGGTCT TTGCTTGGGT GCAGCTCTTT CTGGCTTGAA 
ACTTGAAACT CGATGGGCTG TTGATTTCAA CAGTTTTGCG TGCCAAAGTT 
TAAAATATAA TCATCCACAG ACTGAGGTGC GAAATGAGAA AGCCGATGAG 
TTTCTTGCCC 7CCTTAAGGA A7GGGCAGT7 CTATGCAAAA AATATGTCCA 
AGATGTGGAT TCAAATTTAG CAAGCTCAGA GGATCAAGCG GATGAAGACA 
GCCCTCTTGA CAAGGACGAA TTTGTTGTAG AGAAGCTTGT CGGGATATGT 
TATGGTGGCA GTGACAGGGA AAATGGCATC TATTTTAAGG TCCAGTGGGA 
AGGATACGGC CCTGAGGAGG ATACATGGGA ACCGATTGAT AACTTGAGTG 
ACTGCCCGCA GAAAATTAGA GAATTTGTAC AAGAAGGG C A CAAAAGAAAG 
ATTCTCCCAC TGCCTGGTGA TGTTGATGTC ATTTGTGGAG GCCCACCATG 
CCAAGGTATC AGTGGGTTTA ATCGGTACAG AAACCGTGAT GAGCCACTCA 
AAGATGAGAA AAACAAACAA ATGGTGACTT TCATGGATAT TGTGGCGTAC 
TTGAAGCCCA AGTATGTTCT CAT GG AAAAT GTGGTGGACA TACTCAAATT 
TGCGGATGGT TACCTAGGAA AATAT G C TT T GAGCTGCCTT GTTGCTATGA 
AGTACCAAGC GCGGCTTGGA ATGATGGTGG CTGGTTGCTA TGGTCTGCCA 
CAGTTCAGGA TGCGTGTGTT CCTCTGGGGT GC7CTTTCT? CCATGGTGCT 
CCCTAAGTAT CC7CTGCCCA CCTATGATGT TGTAGTACG7 GGAGGAGCCC 
C7AATGCC7T T7CGCAATGT ATGGTTGCAT ATGACGAGAC ACAAAAACCA 
TCCC7GAAAA AAGCCTTGCT TCTTGGCGAT GCAATTTCAG AT T 7 AC C AAA 
GGTTCAAAAT CACCAGCCTA ACGATGT G AT GGAGTATGGT GGTTCCCCCA 
AGACCGAATT CCAGCGCTAC ATTCGACTCA GTCGTAAAGA CATGTTGGAT 
TGGTCCTTCG G7GAGGGGGC TGGTCCAGA7 GAAGGCAAGC 7CTTGGATCA 
CCAGCCTTTA CGGC7TAACA ACGATGATTA TGAGCGGGTT CAACAGATTC 
CTGTCAAGAA GGGAGCCAAC TTCCGCGACC TAAAGGGCGT GAGGGTTGGA 
GCAAACAATA 7TGTTGAGTG GGATCCAGAA ATCGAGCGTG TGAAACTT7C 
AT CTGGGAAA CCACTGGTTC CTGACTATGC AATGTCATTC ATCAAGGGCA 
AATCAC7CAA GCCG777GGG CGCCTGTGG7 GGGACG AGAC AG7TCC7ACA 



AGCAAGGGTC C7CAC7A7CC GGGAGAACGC AAGG77ACAG GGG77CCCCG 
ATTACTACCG A7TG77TGGC CCGATCAAGG AGAAGTACA7 7CAAGTCGGG 
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1 AGAGCAGCAG CAGCTACCGC AGCCCCTGCC ATGGCGCCGA GCTCCCCGTC 

51 AGCCGCCGCG CCTACACGCG TCTCTGGGCG GAAGCGCGCC GCCAAGGOCG 

102 AGGAGATCCA CCAGAACAAG GAGGAGGAGG AGGAGGTCGC GGCGGCCTCC 

152 TCCGCCAAGC GCAGCCGCAA GGCGGCATCT TCCGGGAAGA AGCCCAAGTC 

201 GCCCCCCAAG CAGGCCAAGC CGGGGAGGAA GAAGAAGGGG GATGCCGAGA 

251 TGAAGGAGCC CGTGGAGGAC GACGTG2GCG CCGAGGAGCC CGACGAGGAG 

301 GACTTGGCCA TGGGCCAGGA GGAGGCCGAG GAGCAGGCCA TGCAGGAGGA 

351 v GCTGGTTGCG GTCGCGGCGG GGTCACCCGG GAAGAAGAGG GTGGGGAGAA 

401 GGAACGCCGC CGCCGCCGCT GGCGACCACG AGCCGGAG7T CATCGGCAGC 

451 CCTGTTGCCG CGGACGAGGC GCGCAGCAAC TGGCCCAAGC GCTACGGCCG 

501 CAGCACTGCC GGAAAGAAAC CGGA2GAGGA GGAAGAGCTC AAGGCCAGAT 

551 GTCACTACCG GAGCGCTAAG GTGGACAACG TCGTCTACTG CCTCGGGGAT 

602 GACGTCTATG TCAAGGCTGG AGAAAACGAG GCAGATTACA TTGGCCGCAT 

651 TACTGAATTT TWGAGGGGA CTGACCAGTG 7CACTAMTT ACTTGCCGTT 

7 01 GGTTCTTCCG AGCAGAGGAC ACGGTTASTCA ATTCTTTGCTL GTCCA2AAG2 1 

751 GTGGAITGGCC ACAAGCATGA CCCTAGACGT GTTTTTCTTT "CTGAGGAAAA 

802 GAACGACAAT GTGCTTGATT GCATTATCTC CAAGGTCAAG \ATAGTCCATG 

851 TTGATCCAAA TATTGGATCCA AAAGCCAAGG CTCAGCTGAT AGAGAGTTGC 

901 GACCTATACT ATGACATGTC TOACTCTGTT GGATATTCTA CATTTGCTAA 

951 TXTCTCGTCT GAAAATGGGC AGTCAGGCAG TGATACCGCT TCGGGTATCT 

1001 CTTCTGATGA TGTGGATCTG GAGACGTCAT CTAGTATGCC AACGAGGACA 

1051 GCAACCCOTC TTGATCTGTA TTCTGGCTGT GGGGGCATGT CTACTGGTCT 

1101 TTGCTTGGGT GCAGCTCTTT CTGGCTTGAA ACOTGAAACT CGATGGGCTG 

1151 TTGATTTCAA CAGTTTTGCG TGCCAAAGTT TAAAATATAA TCATCCACAG 

1201 ACTGAGGTGC GAAATGAGAA AGCCGATGAG T7TCTOGCCC TCCTTAAGGA 

1251 ATGGGGAGTT CTA3TGCAAAA AATA2PGTCCA AGATGTGGAT TCAAATTCAG 

1301 CAAGCTCAGA GGATCAAGCG GATGAAGACA GCCCTCTTGA CAAGGACGAA 

1351 TTTGTTGTAG AGAAGCTTGT CGGGATATGT TATGGTGGCA GT6ACAGGGA 

1402 AAATGGCATC TATTTTAAGG TCCAGTGGGA AGGATACGGC CCTGAGGAGG 

1451 ATACATGGGA ACCGATTGAT AACTTGAGTG ACTGCCCGCA GAAAATTAGA 

1501 GAATTTGTAC AAGAAGGGCA CAAAAGAAAG ATTCTCCCAC TGCCTGGTGA 

1551 TGTTGATGTC ATTTGTGGAG GCCCACCATG CCAAGGTAXC AGTGGGTTTA 

1601 ATCGGTACAG AAACCCTGAT GAGCCACTCA AAGATGAGAA AAACAAACAA 

1651 ATGGTGACTT TCATGGATAT TGTGGCGTAC TTGAAGCCCA AGTATGTTCT 

1701 CATGGAAAAT GTGOTGGACA TACTCAAATT TGCGGATGGT TACCTAGGAA 

1751 AATATGCTTT GAGCTGCCTT GTTGCTATGA AGTACCAAGC GCGGCT7GGA 

1801 ATGATGGTGG CTGGTTGCTA TGGTCTGCCA CAGTTCAGGA TGCGTGTGTT 

1851 CCTCTGGGCT GCTCTTTCTT CCATGOTGCT CCCTAAGTAT CCTCTGCCCA 

1901 CCTATGATGT TGTAGTACGT GGAGGAGCCC CTAATGCCTT TTCGCAATGT 

1951 ATGGTTGCAT ATGACGAGAC ACAAAAACCA TCCCTGAAAA AAGCCTTGCT 

2001 TCTTGGCGAT GCAATTTCAG ATTTACCAAA GGTTGAAAAT CACCAGCCTA 

2051 ACGATGTGAT GGAGTATGGT GGTTCCCCCA AGACCGAATT CCAGCGCTAC 

2101 ATTCGACTCA GTCGTAAAGA CATGTTGGAX TGOTCCTTCG GTGAGGGGGC 

2151 TGGTCCAGAT GAAGGCAAGC TCTTGGATCA CCAGCCTTTA CGGCTTAACA 

2201 ACGATGATTA TGAGCGGGTT CAACAGATTC CTGTCAAGAA GGGAGCCAAC 

2252 TTCCGCGACC TAAAGGGCGT GAGGGTTGGA GCAAACAATA UTGTTGAGTG 

2302 GGATCCAGAA ATCGAGCCTG TGAAACTCTC ATCTGGGAAA CCACTGGWC 

2352 CTGACTATGC AATGTCATTC ATCAAGGGCA AATCACTCAA GCCGTTTGGG 

2402 CGCCTGTGGT GGGACGAGAC AGTTCCTACA GTTGTAACCA GAGCAGAGCC 

2451 TCACAACCAG GTTATAATTC ATCCGACTCA AGCAAGGGTC CTCACTATCC 

2501 GGGAGAACGC AAGGTTACAG GGCTO?CCCCG ATTACTACCG A5TGTTOGGC 

2551 CCGATCAAGG AGAAGTACAT TCAAGTCGGG AACGCAOTGG CTGTCCCTGT 

2601 TGCCCGGGCA CTGGGQTACT GTCTGG3GCA AGCCTACCTG GGTGAATCTG 

2651 AGGGGAGTGA CCCTCtfGTAC CAGCTGCCTC CAAGTTTCAC CTC2TGTTGGA 

2701 GGACGCACTG CGGGGCAGGC GAGGGCCTCT CCTGTTGGCA CCCCTGCAGG 

2751 GGAGGTAGTT GAGCAGTAAA AGGAT GAGAG ATCTGAGCTG AGCTGG 
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F'G- 2A 912 amino acids 

I MAPSSPSPAA PTRVSGRKRA AKAEEIKQNK EEEEEVAAAS sakrsrkaas 

5: SGKKPKSPPK QAKPGRKKKG DAEMKEPVED DVCAEEPDEE ELAMGEEEAE 

101 EQAMQEEVVA VAAGSPGKKR VGRRNAAAAA GDHEPEFIG5 PVAA DEARS N 

151 WPKRYGRS7A AKKPDEEEEL KARCHYRSAK VDNVVYCL.GD DVYVKAGENE 

201 ADYIGRI TEF FEGTDQCHYF TCRWFFRAED TVINSLVSIS VDGHKHDPRR 

251 VFLSEEKNDN VLDCI1SKVK IVHVDPNMDP KAKAQLIKSC DLYYDMSYSV 

301 AYSTFANISS ENGQSGSDTA SGISSDDVDL ETSSSMPTRT ATLLDLYSGC 

351 GGMSTGLCLG AALSGLKLET RWAVDFNSFA CQSLKYNRPQ TEVRNEKADE 

4 01 FLALLKEWAV LCKKYVQDVD SNL \SSEDQA DEDSPLDKDE FWEKLVGIC 

4 51 YGGSDRENGI YFKVQWEGYG PEE3TWEPID NLSDCPQKIR EFVQEGHKRK 

501 ILPLPGDVDV ICGGPPCQGI SG FN RYRNRD EPLKDEKNKQ MVTFMDIVAY 

551 LKPKYVLMEN WDILKFADG YLGKYALSCL VAMKYQARLG MMVAGCYGLP 

601 QFRMRVFLWG ALSSMVLPKY PLPTYDVVVR GGAPNAFSQC MVAYDETQKP 

651 SLKKALLLGD AISDLPKVQN HQPNDVMEYG GSPKTEFQRY IRLSRKDMLD 

701 WSFGEGAGPD EGKLLDHQPL RLNNDDYERV QQ I PVKKGAN FRDLKGVRVG 

751 ANNIVEWDPE IERVKLSSGK PLVPDYAMSF IKGKSLKPFG RLWWDETVPT 

SOI WTRAEPHNQ VIIHPTQARV LT I RENARLQ GFPDYYRLFG PIKEKYIQVG 

3 51 NAVA V P VARA LGYCLGQAYL GESEGSDPLY QLPPSFTSVG GRTAGQARAS 

901 PVGTPAGEW EC 
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FIG. 2B 



RAAAA7AAPA.MAP £ S. PS PAAPTRVS GRKRAAKAEEIHQtf KEEEEEVAAAS 
3AKa3 RXAAS S GXXr KS Pr KQAKPGRKKKGDAEKKEFV£DPVCAE£PDES 
EIAMGEEEAEEQAMQEEVV^VAAGS P GKKRV&R£NAAAAAGDKEP£FX GS 
FVA^EAR5NWPKRyGKSTAAKK?DEEEELK^CHYR5AKVDNVVYCLGC 
B VYVKAGENEAD Y I G RI T EFFEGTDQCHY FT CRWFFRAEDTVTN S LVS I S 
VDGKKKDPRRVTLSEEKNDNVLDCIISKVKIVHVD^ 
rsLyYDM5y£VAYSTFW:iSSENG0SGSDTA5Gl5SDDVDLETSSSMPTRT 
AT LLBL Y5 GCGG?£5 T GLCLGAALSGLKLETRWAVDFNS FACQS LKVNHPQ 
?EVPLVEXA£EFIALLKFtfA\^^ 

FVA^KLVGICYGGSDREMGIYFKVQWEGYGPEEBTWEPICMLSDCPQKIR 
EFVQE GHKRKI LPLPGDVDVI CGGP P CQGI S GFNRYRNRDEPLKDEKNKQ 
MVTFK3I VAYLKP KYVLMENWDILKFADGYLGXYALS CLVAHKYQARL G 
MMVAGCYGLPQFRMRVFLWGALSSMVl^ 

MVAYDSTQKP SLKKALLLGDAI SDLPKVQNHQPNDVMEYGGS PKTEFQRY 
IRLSRKDJ4LDWS FGSGAGPUEGKLLDHQPLRLNIf DDYERVQQI PVXKGAN 
FRDLKGVRVGANN IVEWDP E I ERV7CLS SGKPLVPDYAMS FIXGKSLKP FG 
RLWWTJ ETVP TW? KAE P HNQVI I HPTCARVLT I REMARLQ G FP 0 YYRLFG 
PIXEKYI QVGNAVAVPVARALGYCLGCAYIGESEGSDPLYQIP P S FT SVG 
GRTA&2ARA5 PVGTPAG2W££* KDDRS ELS W 
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Primer 


Sequence 5' - 3' 


1F 


TGGTTGCTATGGTCTGGCACAb I l OAUj 


1R 


CCAGCTCAGCTCAGA T C i o I OA I lb J I ! 


Sea2FN 


/— v AAA 0~T~ A A T"/"*»T A O A i r * v AAA /"* A /"» 

CGAAAGCTAATCTACACAAAGAGC 


Sea2RN 


GATCCTCTGAGCTTGCTAAAr i TG 


3R 


CTCATCTTGGAGTGGCTCATCAG 


S3F 


GAGCACATGAGGGAGAGTGTTG 


S3R 


TCTCTAATTTTCTGCGGGCAG 


4F 


CCTCTGCCCACCTATGATGTTGTA 


5F 


TAAAGGGCGTGAGGGTTGGA 


7F 


TCACATTTGTCATGGCAGGTTATC 


8eF 


CTGAGGAAAAGAACGACAATGTGC 


8eR 


G C AATC AAG C AC ATTG TC GTTCTTTTC CT C 


9eF 


GAAGAAGAGGGTGGGGAGAAGGAACG 


9eR 


TTCTTTGCGGCAGTGCTGCG 


1 1 iF 


GTATTGAATTGATTCTCAACTAGTGCAC 


1 1tR 


CAGGCTCAACGGCGATG 


12iF 


TATGCTTCATCACATAGACCCAAGTC 


12iR 


GATAGACCTAATGCCAAATGAGATTAAG 


13iF 


GC G ATCTTC AGTCTCC ACCATC 


13iR 


GAAGACGTGCCTCCATGTTTCATC 


14F 


GTTGGTTCTTCCGAGCAGAGG 


14R 


GACTGCCACATATCTTATTAATCGC 


15F 


GCATGTGTCAGCAATTGCTTACATTC 


15R 


CCTCTGCTCGGAAGAACCAAC 


16F 


CTGTTCGGAGATTCATGCATGATG - 


16R 


GGAGAACACjAA 1 I 1 bA 1 1 OAA 1 bb 


17F 


GCACTTCACTCTCCTGGCAAACC 


17R 


CGGTACGCTGCTGCTGCTCTC 


18F 


CCATAGCATCTCACATATCGCAAGG 


18R 


GGAAAGAAGGCAGTTAGTTGTAAATGGG 


MU 


AGAGAAGCCAACGCCAWCGCCTCYATTTCGTC 


RaceRT 


CTACAACATCATAGTTGGGCAGAGG 


AP2 marathon 


ACTCACTATAGGGCTCGAGCGGC 


T7 


TAATACGACTCACTATAGGG 


Sd6 


GATTTAGGTGACACTATAG 


M13F 


GTTTTCCCAGTCACGAC 


M13R 


C AG G AAACAGCTATG AC 
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Figure 5 



Hhal Msel 
tacatcaataaaataaggggcgccaacgcaattgtcccttGttttttctaacttaaagttcaagcggcaatgtcg base pairs 
atgtagttattttattccccgcggttgcgttaacagggaacaaaaaagattgaatttcaagttcgccgttacagc 1 to 75 

Msel Msel 

catctgatgtatgaatatcaatttgtaagtactaaacatagtcttaattccaaaattaattacacaacaaagact base pairs 
gtagactacatacttatagttaaacattcatgatttgtatcagaattaaggttttaattaatgtgttgtttctga 76 to 150 

Msel Hinfl 
aaattgtaagcaaacctttcaagtctaattaattcataattacaaatgttattgtaacatcatgtcaccgaatca base pairs 
tttaacattcgtttggaaagttcagattaattaagtattaatgtttacaataacattgtagtacaatggcttagt 151 to 225 

ScrFI 

EcoRII Msel 
taaactaaccaggttcccatgtgtaattagttttataattatattatatttaatatttgtaactaattgatgtga base pairs 
atttgattggtccaagggtacacattaatcaaaatattaatataatataaattataaacattgattaactacact 226 to 300 
Bst.Nl 

Msel Msel Msel 

cagtactaaaattaagcctcttaagccaaaaaatccacatattttagatttaaaatttgaaaacagacgtatcgg base pairs 
gtcatgattttaattcggagaattcggttttttaggtgtataaaatctaaattttaaacttttgtctgcatagcc 301 to 375 

Haelll 

ctagaagagccctgtcactgtcagctaatcaat tacaagaagtggcccatactagttccatcaccagtccagtag base pairs 
gatcttctcgggacagtgacagtcgattagttaatgttcttcaccgggtatgatcaaggtagtggtcaggtcatc 376 to 4 50 

PvuII Haelll Haelll Hhal Hhal Hpall 

tccaccaccccaccctacagctgggtcatctggcacgggtggaggggccaacggccaaaagcgccgcgcacttcc base pairs 
aggtggtggggtgggatgtcgacccagtagaccgtgcccacctccccggttgccggttttcgcggcgcgtga&gg 451 to 525 

Mspl 

Apal 

Hinfl PstI EcoO109I 

ggcgggcaccctCgcggagtcgcgggtgacagcgaaatttcaaatccataccctcccgctgcagacgggccccac base pairs 
ccgcccgtgggagcgcetcagcgcccactgtcgctttaaagtttaggtatgcgagggcgacgtctgcccggggtg 526 to 600 

Haelll 

Taql 

gccgtcaaaatttggacgctcccgctccctcgatcttttgggtttcgtt ttcccagt tcccaccc tctcctccac base pairs 
cggcagttttaaacctgcgagggcgagggagctagaaaacccaaagcaaaagggtcaagggtgggagagaaggtg 601 to 675 

Sau3AI 

Sau3AI Taql Taql Hinfl 
cctqccctqtttccaqatttgaccgatccccttcaattcqatttctacacccacqatqtccaaactccaqa acac base pairs 
ggacgggacaaaggtctaaactggctaggggaagctaagctaaagatgtgggtgccacaggtctgaggtctcgtg 676 to 750 

Hinfl 

ScrFI 
17F EcoRII 

tcactctcrtQQcaaacfi ccttr.gor.cttgccaaccctaQaoaQcaacaQcaactaccacaQf:c:cgt:QgcatoQC base pairs 
aotQaaaagaccatttaacQaaaQcaQaaaaQttaQaatct ctcQtcqtcatCQatoac atcaqQQacaatacgq 751 to 825 
BstNI 17R 
Hhal Sad Hhal Hhal Haelll Sau3AI 

gccgagctccccgtcacccgccgcgcctacacgcgtctctgggcggaagcgcgccgccaaggccgaggagatcca base pairs 
cggctcgaggggcagtgggcggcgcggatgtgcgcagagacccgccttcgcgcggcggttccggctcctctaggt 826 to 900 

ScrFI 

Hhal Hpall 
ccagaacaaggaggaggaggaggaggtcgcggcggcgtcctccgccaagcgcagccgcaaggcggcatcttccgg base pairs 
ggtcttgttcctcctcctcctcctccagcgccgccgcaggaggcggttcgcgtcggcgttccgccgtagaaggcc 901 to 97 5 

Mspl 

Mspl 
Haelll ScrFI 

gaagaagcccaagtcgccccccaagcaggccaagccggggaggaagaagaagggggatgccgagatgaaggagcc base pairs 
cttcttcgggttcagcggggggttcgtccggttcggcccctccttcttcttccccctacggctctiacttcctcgg 976 to 1050 

Hpall 
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fig. 5 

Khal Continued Haelll Haelll 

cgtggaggacgacgtgtgcgccgaggagcccgacyaggaggagttggccatgggcgaggaggaggccgaggagca base pairs 
gcacctcctgctgcacacgcggctcctcgggctgctcctcctcaaccggtacccgctcctcctccggctcctcgt 1051 to 1125 



Mspl 

Haelll Hpall 9eF 

agrraf.graaqaaaaqQtqattacQatCQcqocoqQatcaccCQa qaaQaaQaoaqtqaaaaqaaagaaco ccac base pairs 
ccggtacgtcctcctccaccaacgccagcgccgccccagtgggcccttcttctcccacccctcttccttgcggcg 1126 to 1200 

ScrPI 
Smal 

Hpall Hhal Haelll 

cgccgccgctggcgaccacgagccggagttcatcggcagccctgttgccgcggacgaggcgcgcagcaactggcc base pairs 
gcggcggcgaccgctggtgctcggcctcaagtagccgtcgggacaacggcgcctgctccgcgcgtcgttgaccgg 1201 to 1275 

Mspl 

Hhal Haelll 

caaagcgctacggccgcagcacttgccgcaaaagaagtacattattttctcccagctctggttttgatttgacca base pairs 
g^^rrgr■gat:gccg ClcgtcatQaac9qcqtt:ttct:t cat:gtaat:aaaaaagaat:cgagaccaaaactaaact:got: 1276 to 1350 
9eR 

Hpall 

gatttttactccatgtctgttagtacttgcgagctgagcaatctgctatttgctgatttattgtgcgtgcagacc base pairs 
ctaaaaatgaggtacagacaatcatgaacgctcgactcgttagacgacaaacgactaaataacacgcacgtctgg 13 51 to 1425 

Mspl 

Sac I Haelll Hpall Hhal 

ggatgaggaggaagagctcaaggccagatgtcactaccggagcgctaaggtggacaacgtcgtctactgcctcgg base pairs 
cctactcctccttctcgagttccggtctacagtgatggcctcgcgattccacctgttgcagcagatgacggagcc 1426 to 1500 

Mspl 

ECOO109I 

ggatgacgtctatgtcaaggtccttgttcatcgctttctgttgcttctgctctcatttatgatgtgcatatgtgt base pairs 
cctactgcagatacagttccaggaacaagtagcgaaagacaacgaagacgagagtaaatactacacgtatacaca 1501 to 1575 
Avail 

Msel Kinfl Hpall 

ttgttaaggaagcaagaattgcttgatttttgttgccgactcgcatttccgtgacgagttctgcgtatggtcacc base pairs 
aacaattccttcgttcttaacgaactaaaaacaacggctgagcgtaaaggcactgctcaagacgcataccagtgg 1576 to 16 50 

Mspl 

ScrFI 

TagI BstNI Sau3AI 

ggtacgtggcactgatacacaacgtggtatgctggaagtctggtagtatattttgcatcgaccaggaggtccaga base pairs 
ccatgcaccgtgactatgtgttgcaccatacgaccttcagaccatcatataaaacgtagctggtcctccaggtct 1651 to 1725 

EcoRII Avail 



clal l6iF Hiafl 

r.ggatatatQegqtataatQCttatttqattqcacc ctQttcaoaQattcatQcatiaa tqgcqtiatttaaacaac base pairs 
agctatacacgccatatcacgaataaactaacgtgggacaagcctctaagtacgtactaccgcacaaatctactg 1726 to 18 00 

TaqI 

ScrFI BstNI 
PvuII EcoRII Pvull Haelll Hpall Hhal Hinfl 

gcctcccagacagctgcctgccaggcagctgattctggcccaggcgtccggaatggtgaagttgcgctggcaaga base pairs 
cggagggtctgtcgacggacggtccgtcgactaagaccgggtccgcaggccttaccacttcaacgcgaccgttct 1301 to 1875 
BstNI Kinfl EcoRII Mspl 
ScrFI 

ScrFI 

Haelll EcoRII 
ttctcaggccacctaccaaatatgccctggagcatattgcatgcttctttttttgttctctttccttctatattt base pairs 
aagagtccggtggatggtttatacgggacctcgtataacgtacgaagaaaaaaacaagagaaaggaagatataaa 1876 to 1950 

Bstfitt 



atctcattgttagtgaagtttcacattgcacgtgtcatggaatatttactttcaaatcaacgaggagatgctagc base pairs 
tagagtaacaatcacttcaaagtgtaacgtgcacagtaccttataaatgaaagtttagttgctcctctacgatcg 1951 to 2025 
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ecorv Continued 

attgaggtgtgtgataattattacatactagaagatatcgtgcatgttgccattgggatcgcgaagaatgtggaa base pairs 
taactccacacactattaataatgtacgatcttctacagcacgtacaacggtaaccctaacgcttcttacacctt 2026 Co 2100 

Msel Msel 

atgatgttggtagcttgtattaagaggttaacagttagtgggatgacatgactattagttagagatgatgtggat base pairs 
tactacaaccatcgaacataattctccaattgtcaatcaccctactgtactgataatcaacctctactacaccta 2101 to 2175 

agtaagtgggatatgatgtagatgacttgtgtgttgagacagaactataacatggagttggaaatgggagcagca base pairs 
tcattcaccctatactacatctactgaacacacaaccctgtcttgatattgtacctcaacctttaccctcgtcgt 2176 to 2250 

Msel 

tggtcaaacataccctaaatgcctgtctctacacaatgtggtgattggtgtatagtctggtgttaaaagctggat base pairs 
accagtttgtatgggatttacggacagagatgtgttacaccactaaccacatatcagaccacaattttcgaccta 2251 to 2325 

1 ScrFI 

Hinfl Msel Xbal EcoRll 

actttgattctgttgaagattgtcacacccgaatttaaggacaaatctagatacatctcatatgtgcaccaggat base pairs 
tgaaactaagacaacttctaacagtgtgggcttaaattcctgtttagatctatgtagagtatacacgtggtccta 2326 to 2400 

BstNI 

agtgtatagataccaatgtcataatctttattacacgacgataatgtcttacaaaatatctggtgttacaagatg base pairs 
tcacatatctatggttacagtattagaaataatgtgctgctattacagaatgttttatagaccacaatgttctac 2401 to 2475 

Msel Msel 
cacctttcaacatgtttaatgctgcaaactgttttaattaaacagaatgcagtgttttgaacaaaaaaatgctgc base pairs 
gtggaaagttgtacaaattacgacgtttgacaaaattaatttgtcttacgtcacaaaacttgtttttttacgacg 2476 to 2550 

Msel 

15iF Sau3AI Hinfl Msel 

trtatgctgcatcttatttt Qcatatatcaocaattqcttacatt ccattatQatctctaaaattctttaaattt base pairs 
aaataggacgtagaacaaaacgtacacagtcgttaacgaatgtaaggtaatactagagactctaagaaatttaaa 2551 to 2625 

ctagcatgatgaaagtatttactaattcaactgaacacaaacattgtttgaatgaacaaggcaacacggatgctt base pairs 
gatcgtactactttcataaatgattaagttgacttgtgtttgtaacaaacttacttgttccgttgtgcctacgaa 2626 to 2700 

Msel 

ggaataatggttgtgtataatatcacttagtggttttgctctcacaccacatctttcatgggttctttaataata base pairs 
ccttattaccaacacatattatagtgaatcaccaaaacgagagtgtggtgtagaaagtacccaagaaattattat 2701 to 2775 

Msel Haelll 
gttactgactttaagtttcttattcctttttgtctatcttagctggagaaaacgaggcagattacattggccgc base pairs 
caatgactgaaattcaaagaataaggaaaaacagatagaatcgacctcttttgctccgtctaatgtaaccggcg 2776 to 28S0 

14eF 

attactaaattttttqaqaqaactQaccaatatcactattt tact tacc crtt got tcttcgaag<paqagg acacq base pairs 

taatgagttaaaaaactcccctqactggtcacagtqataaaatqaacqq caaccaaqaaqgctcQtctcc tgtqc 2851 to 292S 

15eR 

gtgtgtatttagtattttgtcattctatgcatgtgtggatttttctggaatgtggaaaacatacagcactctctc base pairs 

cacacataaatcataaaacagtaagatacgtacacacctaaaaagaccttacacctt ttgtatgtcgtgagagag 2926 to 3000 

Msel Haelll Haelll 
tacaccacacacacttctagtatatgtgtacacgttaatgggccaacactagacacatggcccaacatccccct base pairs 
atgtggtgtgtgtgaagatcatatacacatgtgcaattacccggttgtgatctgtgtaccgggttgtaggggga 3001 to 3075 

EcoRV 

caagatgggcgatagatatcaatcatccccatcttgctacataacacatcacactcttttactcctataccctta base pairs 
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Continued „ ,__ n 

gttctacccgctatctatagttagtaggggtagaacgacgtattgtgtagcgtgagaaaatgaggatatgggaat 3076 to 3150 

Hinf I 

qtcaagcaatctgctatttgaccttttgagtttacacgattcaactctaaagtaccattacctaacttctctttg base pairs 

| agct ^ gttagacgataa actggaaaactcaaatgtactaagttgagatt:tcatggtaa£agattgaagagaaac 3151 to 3225 

Clal 
Hinf I 

atgaagaatcgatcaatttccacatgttttgttctatcatgttgaactggattgttagctatattcatggctgac base pairs 
tacttcttagctagttaaaggtgtacaaaacaagatagtacaacttgacctaacaatcgatataagtaccgactg 3226 to 3300 

TaqI 
Sau3AI 

Hinf I Msel 18eF 

, tattatcacaccataa cttcaaggagtcttttcttaatacattcaactctgataagagaccctttatciaia3£ base pairs 
lataatagtgtggtattgaagtccctcagaaaagaattatgtaagttgagactattctctgggaaataggtatcg 3301 to 3375 

Haelll 

arr , rraf , atarf , aea aqg qccataqctcqqtattctgcttcggcggtggaacgggataccacagattgtttcttg base pairs 
tagagtgtatagcgtccccggtatcgagccataagacgaagccgccaccttgccctatggcgtctaacaaagaac 3376 to 34S0 

cttctccatgatactaaatttcctccaacaaacacacaatatcctgaagttgaccttctatcatcaaggcaacta base pairs 
gaagaggtactatgatttaaaggaggttgtttgtgtgttataggacttcaactggaagatagtagttccgttgat 3451 to 3525 

ScrFI 

Msel EcoRII 
ccccagtctgcatcagagtaaccttccacctttagatgaccatgacctttaaagattattccctttccaggacaa base pairs 
ggggtcagacgtagtctcattggaaggtggaaatctactggtactggaaatttctaataagggaaaggtcctgtt 3526 to 3600 

TaqI 

gtcttcaagtatcgcagtatacgatacactgcatcaagatgtccacttctggggtcatgcatatatcgactcacc base pairs 
cagaagttcatagcgtcatacgctatgtgacgtagttctacaggtgaagaccccagtacgtatatagctgagtgg 3601 to 3675 

EcoRV 

acactgactgcatatgtgatatcaggtcttgtatggcacaagtagatgagccgtccaacaagtctttgatacctt base P air s 
tgtgactgacgtatacactatagtccagaacataccgtgttcatctactcggcaggttgttcagaaactatggaa 3676 to 3750 

Sau3AI Hinfl Hinfl TaqI Haelll 

tctttattcacaggatcaccagat: tcagcacataatttatgattcaagtcgataggtgttgctacaggccgacac base pairs 
agaaataagtgtcctagtggtctaagtcgtgtattaaatactaagttcagctatccacaacgatgtccggctgtg 3751 to 3825 

Sau3AI Sau3AI 
cccaacatacctgtttcatcaagtagatctaaaacatatttcctttgggagagaactattccttttggagatcga base pairs 
ttgtatggacaaagt:a g ttcat ctagattttgtataaaggaaaccctctcttgataaggaaaacctctagct 3826 to 3 900 

Bglll 

Sau3AI Hinfl 
gcaatctcaataccaagaaagtatttgagatgaccaagatctttaacctcaaattccttacttagattcttcttt base pairs 
cgttagagttatagttctttcataaactctactggttctagaaattggagtttaaggaatgaatctaagaagaaa 3901 to 3975 

Bglll Msel 

Sau3AI 

agacatgcaatctcaagatcggcatcacctgtaataataatatcatccacatacacagctagaattgcaattcgt base pairs 
tctgtacgttagagttctagccgtagtggacattattattatagtaggtgtatgtgtcgatcttaacgttaagca 3976 to 4050 

Sau3AI 

cgtccaaagtgttgataaaaaacagtgtgatctccgttgcattgtttatatcccatgctacatattgcacgtcta base pairs 
gcaggtttcacaactatttttcgtcacactagaggcaacgtaacaaatatagggtiacgatgtataacgtgcagat 4051 to 4125 

TaqI 

aatctgtcaaaccatgctcttggggactgcttgagaccatacaatgattttttcaatcgacaaactttcccaatt base pairs 
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Continued _ A _ nft 

ttagacagtttggtacgagaacccctgacgaactctggtatgttactaaaaaagttagctgtttgaaagggttaa 4126 to 4200 

ScrFI 
EcoRII Sau3AI 

qtctcaggctttgacaacccagaagggatctccatatagaccccctcctgcaaatcaccatgtaagaaagcactc base pairs 
cagagtccgaaactgctaggtcctccctagaggtatatctggaggagaacgtttagtggtacattctttcgtaag 4201 to 4275 

BstNI 

Mse i Haelll Sau3AI 

ttaacatctagttgatacaagggccatccaaaatttgcagcacaagagatcaatgtccttacagtactcattttt base pairs 
aattgtagatcaactatgttcccggtaggttttaaacgtcgtgttctctagttacaggaatgccatgagtaaaaa 4276 to 4350 

gccactggtgcaaatgtctcatcataatcaattccatatgtttgactataccctcttgcaaccaatcttgcttta base P*irs 
cggtgaccacgtttacagagtagtattagttaaggtatacaaactgatatgggagaacgttggttagaacgaaat 4351 to 4425 



tatcgttccacccttcctcctgggttttgcttcacagtgaatacccatccacaacuaactgccttctttt^uuu* «a ac 
atagcaagatgggaaggaagacccaaaacgaagtgtcacttat ggqtaa^tqttqaC^q^gqqg^q^agag aaat 4426 to 4 500 

Xbal Msel 

ggtagtttctcaaattcccaagtttgatttttttctagagctttaagctcctccaacattgcctcacgccagtta base pairs 
ccatcaaagagtttaagggttcaaactaaaaaaagatctcgaaattcgaggaggttgtaacggagtgcggtcaat 4501 to 4575 

gaattacattgtgcttctttccaatctcttggaattgctacggaatgcaatgatgcaacaaatgctctatatgat base P*irs 
cttaatgtaacacgaagaaaggttagagaaccttaacgatgccttacgttactacgttgtttacgagatatacta 4576 to 4650 



Hinf I 

rs 

: 725 



ggtgacaaagacgcatatgagacataattgctaatgtcatgctcatatccataccttgttggggggactccagct base pair 
ccactgtttctgcgtatactctgtattaacgattacagtacaagtataggtatggaacaacccccctgaggtcga 4651 to 4 

Hhal 

ttagcacgcgctccttttcgtattgcaatgggcaaatcataagtgtcataatcttcagtttctccatgagacgtc base P*i» 
aatcgtgcgcgaggaaaagcataacgttacccgtttagtattcacagtattagaagtcaaagaggtactctgcag 4726 to 4800 

aaaggtacatttatagcctctaatgtgtttggagagaactgctcagtacttgatgctgaattggtttcaggagcc base P ai *s 
tttccatgtaaatatcggagattacacaaacctctcttgacgagtcatgaactacgacttaaccaaagtcctcgg 4801 to 4875 

tgaggttgcacatgggactttcttcttgtatatacttcgcccttatatcgtaagtcgtctccacaagatttatta base P air s 
actccaacgtgtaccctgaaagaagaacatatatgaagcgggaatatagcattcagcagaggtgttctaaataat 4876 to 4950 

ttctcgtgactaggatgtgtctccaattcacttggcattacttgcatcttttgagaagcaccaatcaccacttcc base pairs 
aagagcactgatcctacacagaggttaagtgaaccgtaatgaacgtagaaaactcttcgtggttagtggtgaagg 4951 to 5025 

Hinf I TaqI 
attttatttggttgttgttccattgaatcaaccattctgttctccccctctcgactagcttcatctgtgctagta base P ai ^ 

.^», 3a3 ^ aa r aa r aa ngr aS rt.t a q frrTar aaaa c:aaQaaqQQQaqaQCtcatcqaaqtaqacacgatcat 5026 to 5100 

16iR 

Hinf I Sau3AI 
gagacagaatcaagaaaaaaatttagatctgtcttctcaccatagaaaggcacagtctctctaaatgtaacatcc base P«« 
ctct . gtctta g ttct:tttttt: aaatctagacagaagagtggtatctttccgtgtcagagagatttacattgtagg 5101 to 5175 



Bglll 

Hinfl PstI 

ts 
at 
EcoRV 



niiii.j. 

atgcttacaaacaaacgtcgttcactaggactccaacacttgtatcccttttgccctgcaggatatccaacaaaa base P a1 ^ 
tacgaatgtttgtttgcagcaagtgatcctgaggttgtgaacatagggaaaacgggacgtcctataggttgtttt 5176 to 5250 

BamHI * Sau3AI 
atgcacttcacagcacgaggatccaacttccccacctgaggtctatgatctctgacaaaacatgtacatccaaaa base pairs 
tacgtgaagtgtcgtgctcctaggttgaaggggtggactccagatactagagactgttttgtacatgtaggtttt 5251 to 5325 
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FIG. 5 
Continued 

Hinfl Hinfl 

attttaggtggaaccacaaacttattctcaccgagaagaatctcacatggagtcttcattgcaagtatttttgaa base pairs 
taaaatccaccttggtgtttgaataagagtggctcttcttagagtgtacctcagaagtaacgttcataaaaactt 5326 to 5400 

Msel 

ggagtgcgattaataagatatgtggcagtcaatacagcttcactccataggaacttcggaacattcattgtaaac base pairs 

cctcacqctaattattctatacaccQtcagttatgtcgaag!igaggtiat:cet:t-gaag^rrgraagi-aa r? r tt g 540i ' Co 5475 
14iR 

Hinfl 

atcagcgaacgagcaacttccaaaatgtgacgattcttcctttcagccacaccattttgtggaggtgtatcagga base pairs 

tagtcgcttgctcgttgaaggttttacactgctaagaaggaaagtcggtgtggtaaaacacctccacatagtcct 5476 to 5550 

Msel 

caggatgtctgatgtaatataccatctcttgacagaaatgcattaaatcccttgtttacatactcggttccattg base pairs 
gtcctacagactacattatatggtaaagaactgtctttacgtaatttagggaacaaatgtatgagccaaggtaac 5551 to 5625 

IliF Hinfl 

tctggtcttaqqattttqacttqaqtattQaattqattctcaactaatacagaaaaai-trtg aaaa ^ a nrr^ aJ r base pa i rs 
agaccagaatcctaaaactgaactcataacttaactaagagttgatcacgtgtttttaaaacttttgtgaagtta 5626 to 5700 

X2iF Taqi 
acttcatctttatacttcatcacataaacccaaQtcat:tcgQagaaaaar3afpgataa a gr aa r. aaa g t ^t : t :c base pairs 
tgaagtagaaatacgaagtagtgtatctgggttcagtaaggctctttttgttagctatttcattgtttcatgaag 5701 to 5775 

Clal 

Msel 

atcccattaatagaagtcacaggacatgtccaaacatcagaatgaactagcacaaaaggagatatactcctgata base pairs 
tagggtaattatcttcagtgtcctgtacaggtttgtagtcttacttgatcgtgttttcctctatatgaggactat 5776 to 5850 

Taqi 

cctcgactaatataagatgtccttgtgtgttttgcaaactcacaggcatcacacaatagcttgcttttatccacc base pairs 
ggagctgattatattctacaggaacacacaaaacgtttgagtgtccgtagtgtgttatcgaacgaaaataggtgg 5851 to 5 925 

Hindi I I 

ccactcattacatcaggaaaagctttgcatatcttatcaaaagaaagatgccctaatctacaatgcaagagcatc base pairs 
ggtgagtaatgtagtccttttcgaaacgtatagaatagttttctttctacgggattagatgttacgttctcgtag 5926 to 6000 

Sau3AI 

actgcaacctccttctcttccattcttgttgccagcatagtgcatattgtaccattagtcccctcatgatccata base pairs 
tgacgttggaggaagagaaggtaagaacaacggtcgtatcacgtataacatggtaatcaggggagtactaggtat 6001 to 6075 

ScrFI 

EcoRII MS ei 
taccacaatccattacgcctggtagctgtcccaagtctcttccctgtttccctctcctgaattaaacaattatct base pairs 
atggtgttaggtaatgcggaccatcgacagggttcagagaagggacaaagggagaggacttaatttgttaataga 6076 to 615 0 
BstNI 

Taqi Sau3AI EcoRV 

cgatcaagaataatacgacaatccaattgatcaaccaaggcacttagtgatatcaagttgacaggaaaggttggc base pairs 

gctagttcttattatgctgttaggttaactagttggttccgtgaatcactatagttcaactgtcctttccaaccg 6151 to 6225 
Sau3Al 

Msel 

acatacaaaactgatgacaacttaatagatggagtgcattgcactgtgccaacacccttgatgggttgtggtgta base pairs 
tgtatgttttgactactgttgaattatctacctcacgtaacgtgacacggttgtgggaactacccaacaccacat 6226 to 6300 

EcoRV 

ccatcagcagtttgtataatttctttacgtgtggggggatatcttatatatgatgtaaattcactggacgtgcct base pairs 
ggtagtcgtcaaacatattaaagaaatgcacacccccctatagaatatatactacatttaagtgacctgcacgga 63 01 to 6375 
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Hinfi Msei Continued 

Qtgacatgctttgatgctcctgagtctaaaatccattttaactgtgtgacctgtgtgggtacaaaagcacgagca base pairs 
cactgtacgaaactacgaggactcagattttaggtaaaattgacacactggacacacccatgttttcgtactcgt 6375 to 6450 

Hinfi Sau3AI 

taattaccttcatcagtgtaggcgaagtggacaaaatcccctgtgtgagactcctgatctttatctccagagatt base pairs 
attaatggaagtagtcacatccgcttcacctgttttaggggacacactctgaggactagaaatagaggtctctaa 6451 to 6525 

tgatttttcttcctcaactttgtttcatcttcgtgttccataaatgtttcaagttcttcttgtgtagttgctgca base pairs 
actaaaaagaaggagttgaaacaaagtagaagcacaaggtatttacaaagttcaagaagaacacatcaacgacgt 6526 to 6600 

Msel 

ttcgcccttgcccaactcctgcctccacgacctctgccgcctctaggagcccctcttcctctcccacgattaact base pairs 
aagcgggaacgggttgaggacggaggtgctggagacggcggagatcctcggggagaaggagagggtgctaattga 6601 to 6675 

ttggaaggcttagaacaattacgtgcaatatgtccaacattaccacaattgtaacattctctagtatctttggtt base pairs 
aaccttccgaatcttgttaatgcacgttatacaggttgtaatggtgttaacattgtaagagatcatagaaaccaa 6676 to 675 0 

ScrFI 
Hinfi Kinfl EcoRII 

ctcatagctgaaaacacaggatgaggcggcgtttgagaactttctctcatcactttgagtcttgactcctcctgg base pairs 
gagtatcgactttitgtgtcctactccgccgcaaacccctgaaagagagtagtgaaactcagaactgaggaggacc 6751 to 6825 

BstNI 

Taql 

gatatggcagctatggcttcttgtaggctaggaagagtggattgatgaaacatggaggcacgtcttccctcgaac base pairs 
ctatar!car.caataccqaagaacatccoatccttctcacctaa ct.actt.r.atac:ctccqtQcaqaaq aqaqcttq 6826 to 6 900 

13iR 

tctgagt-ttagcccccttagcaattgaagtacacgtcttttttccacccatttcttcgcccaagcaacacactct base pairs 
agactcaaatcgggggaatcgttaacttcatgtgcagaaaaaaggtgggtaaagaagcgggttcgttgtgtgaga 6901 to 6975 

Sau3AI Sau3AI 

gagtgtggtagctcaataggatcataatgatcaacatcagcccataaacattgtaactcctgaacgtactccgcc base pairs 
ctcacaccatcgagttatcctagtattactagttgtagtcgggtatttgtaacattgaggacttgcatgaggcgg 6976 to 7050 

Sau3AI Sau3AI 13 iF 

acagatcQctccccctatttaatattataaaq qcQatcttcagtctccacca^ aacataacatttccaacticcc base pairs 
tgtctagcgagggggacaaactataatacctccgctagaagtcagaggtggtagttgtattgtaaaggtcgaggg 7051 to 7125 

PstI Hinfi 
gagtacatttcttcaagtgctttccacatttctgcagcacttatgattgtatcaacagtgctagcaattgctgga base pairs 
ctcatgtaaagaagttcacgaaaggtgtaaagacgtcgtgaatactaacatagttgtcacgatcgttaacgacct 7126 to 7200 

Msel 

atcatagaactcaacatccacgctgccactaaagagtttatagcatcccagtctttccattcatcacttaactta base pairs 
tagtatcttgagttgtaggtgcgacggtgatttctcaaatatcgtagggtcagaaaggtaagtagtgaattgaat 7201 to 7275 

Hinfi 

Msel Xhol Xbal 

tccttgggctcaacgacatcccctttaacatagccctcgagtctctttgccttcaataatcgcaatgctcttcta base pairs 
agqaacccqaqttqctqEaqaqqaaattataticqqGaqctcaQaQaaacaqaaattatta acottacaaqaaqat 7276 to 73 5 0 

Taql S41R 

Msel 

gaccatgccaaataatttttcaccccttctaacttaatctcatttggcattaggtctatcttctgaactggctct base pairs 
ctqqta cqqtttattaaaaaqtiqaqqaaaatt qaattaqaqtaaaccataatccaqataG aaqacEtqaccaaqa 7351 to 742 5 

12iR 

Msel Taql 
atatgagcaacattgtctttaattgatgatggagcctcatccctttttgctgacagtaattcgaccaatttacca base pairs 
tatactcgttgtaacagaaattaactactacctcggagtagggaaaaacgactgtcattaagctggttaaatggt 7426 to 75 00 
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FIG. 5 

Continued pvuii 

agaacctgctcaattccttgattttcccccataatatatatgtaataataactaggaaactacttgggcagctgc base pairs 
tcttggacgagttaaggaactaaaagggggtattatatatacattattattgatcctttgatgaacccgtcgacg 7501 to 7575 

EcoRII 
ScrFI 

Sau3AI Xbal EcoRII BstNI Hhal Sau3AI 

accaagatctgggccacaacgtctagaagccaggaccaggagcgcctcctcttcctcctcctccgagctggatgg base pairs 
- agttctagacccagtgttgca gatcttcggtcctggccctcgcggaggagaaggaggaggaggctcgacccacc 7576 to 7650 

BglH BstNI ScrFI 

Avail 

Hpall 

atctcagtcacaggacgcgggcagcaggggggagcagcagcacctgtgtgccggcagcctcctcaagggttggac base P^irs 
tagagtcagtgtcctgcgcccgtcgtccccccccgtcgtcgtggacacacggccgtcgaaggagttcccaacctg 7651 to 7725 

Mspl 

PvuII Sau3AI 
gagctgcggcagctggagagcctcccaagcacccctatctccagatccctgtcgccgacgagtgcccgcgtccac base pairs 
ctC3ncgccgt ^^^^^^^ ag ^ a i-r^ g i-gggq a raqaQatctaqgaacaqcqqctqctcacqqqcgcaggtg 7726 to 7800 

S6RN 

ScrFI 

HaeIII ECOO109I BstNI 

gtccttggccgccctcgccttgtcggcggtggcgtcctctgtgctgtggctcgggacctgtccctggcctcctgc base pairs 
caggaaccggcgggagcggaacagccgccaccgcaggagacacgacaccgagccctggacagggaccggaggacg 7801 to 78 

Avail EcoRII 

HaeIII 

ScrFI 

Hhal ScoRII Taql 

gcggcctccctgctggcgtcggtgtactcgcccgtcttctgcctggtcacgtccctcgcctccctcgatcgctcg base pairs 
cgccggagggacgaccgcagccacatgagcgggcagaagacggaccagtgcagggagcggagggagctagcgagc 7876 to 7950 
HaeIII BstNI Sau3AI 



HaeIII HaeIII Sau3AI Taql HaeIII 

tgtgcctcggcggcctccttcggccgtcgctgatctccttctcggtggtcttctccgtcgaggccgaagacactc base pairs 
acacggagccgccggaggaagccggcagcgactagaggaagagccaccagaagaggcagctccggcttctgtgag 7951 to 8025 



875 



ScrFI 
EcoRII 

gtcaccgcgacgccatcgccgttgagcctggctctgataccatgtggatttttctggaatgtggaaaacatacag base pairs 
r. 3 gr n gr- a rtgr qQr.aacaacaactcggac cqagactatqqtacacctaaaaagaccttacaccttttgtatgtc 8026 to 8100 
lliR BStNl 



Msel HaeIII HaeIII 
cactctctctacaccacacacacttctagtatatgtgtacacgttaatgggccaacactagacacatggcccaac base pairs^ 
gtgagagagatgtggtgtgtgtgaagatcatatacacatgtgcaattacccggttgtgatctgtgtaccgggttg ^ 



8101 to 8175 



7F 

^g^^hg^^n^grggrataqra^ tgacattta r^taacaqattatcaattctttqqtqtccataaqtgtggatgg base pairs 
tcgtacagttcaccgtatcgtgagtgtaaacagtaccgtccaatagttaagaaaccacaggtattcacacctacc 8176 to 8250 

HaeIII 8eF 

^^ nnr ^g a ^^ aga rr ^ ^g1^^^^^f^f^^r oa oaa5aaoaacQacaatqtqcttqattqcattatctccaa base pairs 
aJtarr ^pr,i- a gg a r^rg^^A^ aaa g a n a qarrrrt:tttgttactqttacacqaactaacqtaatagaggtt 8251 to 8325 

8eR 

Sau3AI PstI 
ggtcaagatagtccatgttgatccaaatgtaagtttgctgcagtttgctgagagctttgtggttttgctatacac base pairs 
ccagttctatcaggtacaactaggtttacattcaaacgacgtcaaacgactctcgaaacaccaaaacqatatgtg 8326 to 8400 

5RN 
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FIG. 5' 

Continued BamHi PvuI1 
ataatgtttctgaccaccattgttttgttgcctacttgccttagatggatccaaaagccaaggctcagctgatag base pairs 

tattacaaagactgatggtaacaaaacaacggatgaacggaatctacctaggttttcggttccgagtcgactatc 8401 to 8475 

Sau3AI 

agagttgcgacctatactatgacatgtcttactctgttgcatattctacatttgctaatatctcgtctggtaatt base pairs 
tctcaacgctggatatgatactgtacagaatgagacaacgtataagatgtaaacgattatagagcagaccattaa 8476 to 8550 

Msel 

ccttctgcatcatcttttttggttgactagctgaatgcagttagctttgccaaagagttaaatacatgagttgtt base pairs 
ggaagacgtagtagaaaaaaccaactgatcgacttacgtcaatcgaaacggtttctcaatttatgtactcaacaa 6551 to 8625 

TaqI Msel Msel 

cctgcactcgaaaagggatgtcaataatgtccacaaactctgaaaatgtatttttagatacttaacttgttaagt base pairs . 
ggacgtgagcttttccctacagttattacaggtgtttgagacttttacataaaaatctatgaattgaacaattca 8626 to 8700 

cagtaaaacctgtcagatacttgggttttgggtacgattaccatccttatgtgagtaaaactcgtcaagggatgt base pairs 
gtcattttggacagtctatgaacccaaaacccatgctaatggtaggaatacactcattttgagcagttccctaca 8701 to 8775 

TaqI Seq2FN 

^iitg!i':7tg t : , -g a< - 1 'a < -g t - at - t - a g afafl " f ' r g rrr g t ' :r.raaaaactaatctacacaaacagcttatgtaal:qta base pairs 
gttactgcacaactaacacataatctataagacaaacaagctttcgattagatgtgtttgtcgaatacattacat 8776 to 8850 

Hindi I I Haelll 
aaacctcaaacaaacttgcctcttcataagcttaggtttataggattagcgtttagtgcatgtaaggcctatttg base pairs 
tttggagtttgtttgaacggagaagtattcgaatccaaatatcctaatcgcaaatcacgtacattccggataaac 8851 to 8 925 

BstNI ScrFI ScrFI 

Kaelll Sad EcoRII TaqI EcoRII 

cttcacggcctccctgccgagctcctggctagacagccatcctggccgtaggtgcccgaaatcgaacacctggga base pairs 
gaagtgccggagggacggctcgaggaccgatctgtcggtaggaccggcatccacgggctttagcttgtggaccct 8 926 to 9000 

EcoRII BstNI BstNI 

ScrFI Haelll 

ScrFI 
EcoRII 

gccacgtttgcactagcaggttttcctgggtgcaaaccaaacacgcctatagtgttcaagtataactgaattggt base pairs 
cggtgcaaacgtgatcgtccaaaaggacccacgtttggtttgtgcggatatcacaagttcatattgacttaacca 9001 to 9075 

BstNI 

Msel 

gctcacctttgtctaatagcttaagtttttggttttcatcggtgcatgcaactccatactcaatagtcaatatga base pairs 
cgagtggaaacagattatcgaattcaaaaaccaaaagtagccacgtacgttgaggtatgagttatcagttatact 9076 to 9150 

BstNI 

Xhol Hinfl 

tatagtgttcaagcatagaactctcgagtttgaatcctggcaggggcaatcaaataaaataattgcagcttaccc base pairs 
atatcacaagttcgtatcttgagagctcaaacttaggaccgtccccgttagtttattttattaacgtcgaatggg 9151 to 9225 

TaqI EcoRII 
ScrFI 

S3iF 

rt-3f.M-rtaggr.tt: aaqgacataaqcrqaoao!:attQ aattataaqtQtottctccatctttctctaacaqatgaa base pairs 
gataaagatgcaaactcgtgtactccctctcacaacttaatattcacacaagaggtagaaagagattgtctactt 9226 to 9300 

Hinfl Msel Msel 

ctggtttgtgcatgtaactcaatatgatatttgagtcaaatgtttactttaaaatcatagttgatgcaatttaat base pairs 
gaccaaacacgtacattgagttatactataaactcagtttacaaatgaaattttagtatcaactacgttaaatta 9301 to 9375 
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FIG. 5 
Continued 

aacatatttttctggtctcgtgtgagggagtgtacgtataactgaattgcacacatttccttacagcctaggttt base pairs 
ttgtataaaaaaaccagagcacactcccccacatgcatattgacttaacgtgtgtaaaggaatatcgaacccaaa 9376 to 94S0 

Sau3AI 

ttgactgcaactgttggtgcatgtagctcaataactaaagtcgatccggacagtccacagtgaacaagtttgaca base pairs 
aactgacgttgacaaccacgtacatcgagttattgatttcaacnagacctgtcagatgtcacttattcaaactgt 9451 to 9525 

cttgtaaaatgtgcatatatttttacaaacgctggcacttttttcctaatagaaaatgggcagtcaggcagtgat base pairs 
gaacattttacacgtacataaaaatgtttgcgaccgtgaaaaaaggattatcttttacccgtcagtccgtcacta 9526 to 9600 

Sau3AI 

accgcttcgggtatttcttctgatgatgtggatctggagacgtcatctagtatgccaacgaggacagcaaccctt base pairs 
tggcgaagcccataaagaagactactacacctagacctctgcagtagatcatacggttgctcctgtcgttgggaa 9601 to 9675 

Sau3AI 

cttgatctgtattctggctgtgggggcatgtctactggtctttgcttgggtgcagctctttctggcttgaaactt base pairs 
gaactagacataagaccgacacccccgtacagatgaccagaaacgaacccacgtcgagaaagaccgaactttgaa 9676 to 9750 

Sau3AI 

gaaactgtaatcttctaactagtcatctgttggatagaatatgttcacgatctcagaacttattctattgttctg base pairs 
ctttgacattagaagattgatcagtagacaacctatcttatacaagtgctagagtcttgaataagataacaagac 97S1 to 9825 

Msel 

gcttgcagcgatgggctgttgatttcaacagttttgcgtgccaaagtttaaaatataatcatccacagactgagg base pairs 
cgaacgtcgctacccgacaactaaagttgtcaaaacgcacggtttcaaattttatattagtaggtgtctgactcc 9826 to 9900 

Hinfl 

tatggatagtaaacttcatcttggattccatctgttctgtcagctactcttacaaagtgtctggatttttggatg base pairs 
atacctatcatttgaagtagaacctaaggtagacaagacagtcgatgagaatgtttcacagacctaaaaacctac 9901 to 9975 

Msel 

taggtgcgaaatgagaaagccgatgagtttcttgccctccttaaggaatgggcagttctatgcaaaaaatatgtc base pairs 
atccacgctttactctttcggctactcaaagaacgggaggaattccttacccgtcaagatacgttttttatacag 9976 to 10050 

Hinfl Sau3AI 
caagatgtggattcaaatttagcaagctcagaggatcaagcggatgaagacagccctcttgacaaggacgaattt base pairs 
Jt .«-rf a ^ a r.rr aa g rrraaat:Mtceoaatctcctaq ttcqcctacttctqtcgggagaaetgttcctgctcaaa 10051 to 10125 
Seq2RK 

Hindi I I MseI 
gttgtagagaagcttgtcgggatatgttatggtggcagtgacagggaaaatggcatctattttaaggtacttcag base pairs 

caacatctcttcgaacagccctatacaataccaccgtcactgtcccttttaccgtagataaaattccatgaagtc 10126 to 10200 

Hinfl Msel 
tgtcatttgttcatttctacttgattccaacaaaaaaatcaattacttaagcctgtcaaacgatggatatttctg base pairs 
acagtaaacaagtaaagatgaactaaggttgtttttttagttaatgaattcggacagtttgctacctataaagac 10201 to 10275 

Pstl Haelll 
tatattttgctgtaacgctagatttctgcaggtccagtgggaaggatacggccctgaggaggatacatgggaacc base pairs 
atataaaacgacattgcgatctaaagacgtccaggtcacccttcctatgccgggactcctcctatgtiacccttgg 10276 to 10350 

Avail 

gattgataacttgaggttagtgtatggtatatcgtctgcttgttgccttgtatacctatttgcatctaatccttg base pairs 
c ta3^ a ^g a *^^ aa ^ a ^ ra ^ araraaga ^^ 10351 to 10425 

SeqlRN 

Hinfl 
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FIG. 5 
Continued 

ttttgcaaaccagtgactgcccgcagaaaattagagaatttgtacaagaagggcacaaaagaaagattctcccac base pairs 
?? ^ a ^ t - < . < - C grr- a rr oaffaaaeocecttt aatetett:aaacatqttCttcccqtqtttCCtttetaagagggtg 10426 to 10S00 
S3eR 

EcoRV 

tgcctgtgagtatttagttcgttgtgattttgctcgctatttgtttagctccccttctctatttggtgatatctg base pairs 
acggacactcataaatcaagcaacactaaaacgagcgataaacaaatcgaggggaaaaaataaaccactatagac 10501 to 10 575 

Haelll Msel 
cctattttattctttcaaagggtgatgttgatgtcatttgtggaggcccaccatgccaaggtatcagtgggttta base pairs 
ggataaaataagaaagtttcccactacaactacagtaaacacctccgggtggtacggttccatagtcacccaaat 10576 to 10650 

3F 

a ^^ f7 g^ ? l 7? qaaar:gataataaqccac tcaaaqatqaqaaaaacaaacaaatggtgactttcatggatattgtgg base pairs 
^ a ^ at . a r^t-rgq rArr a rtcaataa a»tctactctttttgtttqtttaccactqaaaqtacctataacacc 10651 to 10725 
3R 

cgtacttgaagcccaagtatgttctcatggaaaatgtggtggacatactcaaatttgcggatggttacctaggaa base pairs 
gcatgaacttcgggttcatacaagagtaccttttacaccacctgtatgagtttaaacgcctaccaatggatcctt 10726 to 10800 

Hhal IF 
aatatgctttgagctgccttgttgctatgaagtaccaagcgcggcttggaatgatggtggct99ttqct a tqqtc base pairs 
ttatacgaaactcgacggaacaacgatacttcatggttcgcgccgaaccttactaccaccgaccaacgataccag 108 01 to 10875 

Race2A 

tQgcacaqttca aaatqcqtqtq tacctctqcqqr afTf.rrttct^eeafegqtctqttctqtaccttqctqtttta base pairs 
a^ggtgi-^ a gi-^ r *«?g gacag a t aaaaac ^ 10876 to 10950 

Race2B 

ScrFI 

Hinfl EcoRII 
tatgcttcgctagattcatattgcactgttggctgctggctaaccaggtgtacgtgtatttgacaatttaggfe^e base pairs 
atacgaagcgatctaagtataacgtgacaaccgacgaccgattggtccacatgcacataaactgttaaatcea-eg 10951 to 11025 

BstNI 

RacelA 4F 

^^^^m^ or ^r_acccaactatQatqt tot aqtacqtqqaqqaqcccctaatgccttt cggtgagtgcaat base pairs 
agggcirteata gqaQacqqattaatactacaacatc atqcacctcctcqqqqattacqqaaagccactcacgtta 11026 to 11100 
RacelB RaceRT 

cacaaaccactactatgaaatcatgtggaatgtgtaaaatacgctgaccaactgaatttgttgcagcaatgtatg base pairs 
gtgtttggtgatgatactttagtacaccttacacattttatgcgactggttgacttaaacaacgtcgttacatac 11101 to 11175 

gttgcatatgacgagacacaaaaaccatccctgaaaaaagccttgcttcttggcgatgcaatttcagatttacca base pairs 
caacgtatactgctctgtgtttttggtagggacttttttcggaacgaagaaccgctacgttaaagtctaaatggt 11176 to 11250 

Msel PstI 

aaggcaagtgttctgtcaagttcatgcatttctcagtgagcatgctatttaactcttctctgcaggttcaaaatc base pairs 
ttccgttcacaagacagttcaagtacgtaaagagtcactcgtacgataaattgagaagagacgtccaagttttag 11251 to 11325 

EcoRI Hhal TaqI 

accagcctaacgatgtgatggagtatggtggttccccaagaccgaattccagcgctacattcgactcagtcgtaa base pairs 
tggtcggattgctacactacctcataccaccaaggggttctggcttaaggtcgcgatgtaagctgagtcagcatt 1132 6 to 11400 

Hmf I 

Haelll Msel 
aggtaaaaaaccccgtgaactactactggttggccttcactacgaatatgttaggatttaatttcagaagaaccg base pairs 
tccattttttggggcacttgatgatgaccaaccggaagtgatgcttatacaatcctaaattaaagtcttcttggc 11401 to 11475 

PstI Avail 
ccttttttttcttggtgcttcggtactactgcagcaagctcactcttattatcatgtcagacatgttggattggt base pairs 
ggaaaaaaaagaaccacgaagccatgatgacgtcgttcgagtgagaataatagtacagtctgtacaacctaacca 114 76 to 11S50 
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FIG. 5 

Avail Continued Sau3AI Mael 

ccttcggtgaagggggctggtccagatgaaggcaagctcttggatcaccagcctttacggcttaacaacgatgat base pairs 
ggaagccacttcccccgaccaggtctacttccgttcgagaacctagtggtcggaaatgccgaattgttgctacta 11551 to 11625 



Hinfl 

tatgagcgggttcaacagattcctgtcaagaaggttggtggcttggtcgcatttgtgccttcctttgttgttttt base pairs 
atactcgcccaagttgtctaaggacagttcttccaaccaccgaaccagcgtaaacacggaaggaaacaacaaaaa 11626 to 11700 



5F 

tccccttctoaaacaatcatctctctt.tcctatQacaaQQaQccaacttccacaacc taaae>qqcQ&aaogqtta base pairs 
aggggaagactttgttagtagagagaaaggatactgtccctcggttgaaggcgctggatttcccgcactcccaac 11701 to 11775 



BamHI Taql 

ga.gcaaacaatattgttgagtgggatccagaaatcgagcgtgtgaaactttcatctgggaaaccactggtatgtg base pairs 
ctcgtttgttataacaactcaccctaggtctttagctcgcacactttgaaagtagaccctttggtgaccatacac 11776 to 11650 
Sau3AI 



tgctatttccgtgctgttgtttcctataactgtgcaacatttactttcccatattcaaactcataactgacgaga base pairs 
acgataaaggcacgacaacaaaggatattgacacgttgtaaatgaaagggtataagttcgagtattgactgctct 11851 to 11925 



Hinfl 

tgctgcaactactgtaagattcatggctaacccatgacaacattttgcacacatctttgttatctaggttcctga base pairs 
acgacgttgatgacattctaagtaccgattgggtactgttgtaaaacgtgtgtagaaacaatagatccaaggact 11926 to 120Q0 



ctatgcaatgtcattcatcaagggcaaatcactcaagtaagtttcaaaacattttttttgttttttgggggaaaa base pairs 
gatacgttacagtaagtagttcccgtttagtgagttcattcaaagttttgtaaaaaaaacaaaaaaccccctttt 12001 to 12075 

Haelll Hhal 

gtaggttattgtttacttgtgcttacatatgatgttgcaggccgtttgggcgcctgtggtgggacaagacagttc base pairs 
catccaataacaaatgaacacgaatgtatactacaacgtccggcaaacccgcggacaccaccctgttctgtcaag 12076 to 12150 

ScrFI 

EcoHII Hae 12 I 

ctacagttgtaaccagagcagagcctcacaaccaggtcagcttcagaaaggccactccttttcgccaatccctgc base pairs 
gatgtcaacattggtctcgtctcggagtgttggtccagtcgaagtctttccggtgaggaaaagcggttagggacg 12151 to 12225 

BstNI 

Sau3AX 

atctgtatttactattagcgtgtgttcccatatgatcattaccgaacatgttgtccacacaggttataattcatc base pairs 
tagacataaatgataatcgcacacaagggtatactagtaatggcttgtacaacaggtgtgtccaatattaagtag 12226 to 123 00 

ScrFI 

Hinfl ECO0109I Hpall 

cgactcaagcaagggtcctcactatccgggagaacgcaaggttacagggcttccccgattattaccgattgtttg base pairs 
gctgagttcgttcccaggagtgataggccctcttgcgttccaatgtcccgaaggggctaataatggctaacaaac 12301 to 12375 
Avail Mspl 

Haelll Sau3AI 
gcccgatcaaggagaagtaagttcctgttttcaagttgcctgtaccagatctagtcactattgaaagttttcagc base pairs 
cgggctagttcctcttcattcaaggacaaaagttcaacggacatggtctagatcagtgataactttcaaaagtcg 12376 to 12450 
Sau3AI Bglll 



agcaagccattcatcagttagttacagctcttgaaagccttacctctgaacatgtgtgctttctctgatggtgat base pairs 
tcgttcggtaagtagtcaatcaatgtcgagaactttcggaatggagacttgtacacacgaaagagactaccacta 12451 to 12525 

Mspl 

HpaXI EcoRII 
aggtacattcaagtcgggaacgcagtggctgtccctgttgcccgggcactgggctactgtctggggcaagcctac base pairs 
tccatgtaagttcagcccttgcgtcaccgacagggacaacgggcccgtgacccgatgacagaccccgttcggatg 12526 to 12600 

ScrFI 
Smal 
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Hinfi pvuii Continued 

ct . gggtgaatctga ggg g agt:gaccctctgtaccagccgcctccaagtttcacctctgtt^3aggacgcactgcg base pairs 
gacccacttagactcccctcactgggagacacggtcgacggaggttcaaagtggagacaacctcctgcgtgacgc 12601 to 12675 

BStNI 

ECO0109I PstI Sau3AI 

gggcaggcgagggcctcttcctgttggcacccctgcaggggaggtagttgagcagtaaaaggatgacagatctga base pairs 
cccgtccgctcccggagaaggacaaccgtggggacgtcccctccatcaactcgtcat tr f r f C?t;agcg^gtaqaCt 12676 to 12750 

Haelll 1R B <3 11J 

TaqI 

gctgagctgggcaacatccagcggcaggagcatttctggttcggttcgattcgggctcacga base pairs 
rgagtgqacc cattqtiaggtcgccgtcctcgtaaagaccaagccaagctaagcccgagtgct 12751 to 12B12 

Hinf I 
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FIG. 7 
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FIG. 8 
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FIG. 9 





SAM binding 


Cytosine binding 


Motif 


M.Hhai 


zmet2a 


M.Hhal 


zmet2a 


1 


Phe18 


Try347 






fl 


Glu40 


Gln407 






Trp41 


Trp408 






III 


Asp60 


Asp428 






IV 


Pro80 


Pro516 


Phe79 


Pro515 


Gln82 


Gln82 


Cys81 


Cys517 


V 


Leu 100 


Val542 






VI 






Glu119 


Glu559 






Asn120 


Asn560 






Val1 21 


Val561 


VIII 






Arg165 


Arg605 


X 


Asn304 


Asn851 
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FIG. 12 



GENOTYPE 


NUMBER 
OF PLANTS 


TOTAL 
5mCytosine 
(%) 


% WT 
levels 


% 

decrease 


wild type 


3 


34.40 + 0.55 


100 


0.0 


heterozygous zmet2a-mu1 


7 


32.00 + 0.90 


93.0 


7.0 


homozygous zmet2a-mu1 


5 


30.40 + 0.19 


88.4 


11.6 
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FIG. 14 

= zmet2a::Mu1 Mo17 Fl BC1 
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FIG. 16 



5' LTR 

catgcTGTTGGGCCATGTGTCTAGTGTTGGCCCATTAACGTGTACA 
CATATACTAGAAGTGTGTGTGGTGTAGAGAGAGTGCTGTATGTTTT 
CCAC.-.TTCCAGAAAAATCCACATGGTATCAGAGCCAGG 

PBS 

3 ' LTR 

PPT 

GAGGGGGAGTGTTGGGCCATGTGTCTAGTGTTGGCCCATTAACGTG 
TACACATATACTAGAAGTGTGTGTGGTGTAGAGAGAGTGCTGTATG 
TTTT C C AC AT T C C AG AAAAAT C CACAca t g c 
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FIG. 17 



SPRITE-1 - 
hopscotch - 
retrofit 
arabpolprt- 
copia 
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arabpolprt- 
copia 



SPRITE- I ■ 
hopscotch - 
Retrofit 
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copia 



SPRITE-1 - 
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Gag 

C YNCGNVGH I ARNC 
CQVCSRVGHTALNC 
CQVCFKRGHTAADC 
CSNCGRTGHEKKEC 
CHHCGREGHIKKDC 



Protease 

TQVTQLKWI LDSGAS KH 
QNGSNVPWYTDTGATDH 
SYGIDTNWYIDTGATDH 
GKTKLGDIILDSGAS HH 
S VMDNCGFVLDSGAS DH 



Integrase 

QVKILRPDN-GTEYVNKGFNAFLSRNGILHQTSCPDTPPQNGVAERKNRHILE 
KIIAFQSDW-GGE~YEKLNAfiFKTlGIHHQVSCPfiTHQQNGAAERKHRHIVE 
KIIAMQTDWRGGR--YQKLNSFFAQIGLIIMCHVLTLIRQNGSAERKHRHIVE 
TVKMVRSDH-GTE-- -FMCLSSYFRENGIIHQTSCVGTFQQNGRVERKHRHILN 
KVVYLYIDN-GREYLSNEMRQFCVKKGISYHLTVPRTPQLNGVSERMIRTITE 

Reverse Transcriptase 

RYKARLVARGY SQTYGI DYDSTHVPVAfCMSTVRTLI SCAANFGWpL YQLDVKKAFLKGdLQEEVYMEI P PG f 5 9 > AlLAVYVDD III 
ft LKARI.VAKGFKQQYGI DYDDTFS PWKHST IRLVLS LAVSQkWS^HQLDVqNAFLHGiLEE TVYMKqPPG {59) I YILVYVDDII I 
RYKARLVAKGFKQRYGIDYEDTFSPWKAATIKlILSIAVSRG^SLRQI'DVQNArLHGFLEEEVYMQQPPG (59)MFVLVYVDDIIV 

rykarlvvqgnkqvbgedyketfapwrmttvrtll { S 9 J LRVLI YVDDLLI 

RYKARLVARGFTQKX QI DYEETFAPVARI SS Fl ^ 

RNase H 

DADWGSClJDRRSTSGYCVrVGG^LVSWRSKKQSWSRSTAEAEYRA>t?\LAICEMLWIKGLL (25 ) NPVQHdRTKHVEIDrfF 

dadwagcpddrkstggyalflgp^iswkskkqstvsrssteaeykamanatasviwlqsl-l (25) k?i fnartkh i evd fh f 
dadwagsiddrkstggfavflgs-ni*vsws&^ npvfhahtkhievdyhf 

0SDWQSCPLTRRS ISAYVVLLGG-SPISWKTKKQISTVSHSSAEAEYRAMS YALKEI KWLRfCLL { 25 J KPVFHeRTKHIESDCH S 
DSDWAGSElDRKSTTGYLFKMFDFNLICWm'KRQKSVAASSTEAEYMALFXACREALWLKFLL (25) NPSCHKRAKHIDIKYHF 
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FIG. 22 

BstNl EcoRll Mspi Hpali 
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FIG. 23 



GGGAATTCGATTACTCACTATAGCGCTCGAGCGGCCGCCCGGGCAGGTTCGAAAACCATC 
AACCTAACGATGTAATGGAGTATGGTGGTTCCCCCAAGACAGAGTTCCAGCGCTACATTC 
GA C TTG GTCGTAAAGACA TG TTG GA TTG GTCG TTTG G TGA GGAGGCTGGTCCAGA TGAA G 
GCAAGCTCTTGGATCACCAGCCCTTACGGCTTAACAATGATGATTATGAGCGGGTTAAGC 
AAATTCCTGTCAAGAAGGGAGCCAACTTCCGTGACCTAAAGGGTGTCAAGGTTGGAGCAA 
ATAATGTTGTTGAGTGGGATCCAGAAGTCGAACGTGTGTACCTTTCGTCTGGGAAACCAC 
TGGTTCCTGACTATGCGATGTCATTCATCAAGGGCAAATCACTCAAGCCATTCGGGCGCC 
TGTGGTGGGACCAGACGGTTcCTACAGTTGTGACCAGAGCAGAGCCTCATAACCAGGTTAT 
ATTGCATCCGACTCAAGCAAGAGTCTTGACTATCCGGGAGAACGCAAGGTTACAGGGCTT 
CCCCGATTACTACCGATTGTTTGGACCGATCAAGGAGAAGTATATTCAAGTCGGGAACGC 
AGTGGCAGTCCCTGTTGCACGGGCACTGGGCTACTGTCTGGGTCAAGCCTACCTGGGTGA 
ATCTGACGGGAGTCAGCCTCTGTACCAGCTGCCTGCAAGTTTTACCTCTGTGGGGCGAAC 
CGCGGTTCAGGCGAATGCCGCTTCTGTTGGCACTCCTGCGGGGGAGGTAGTCGAGCAGTA 
AAAGGATAGCGGAGCAACCCTGGTTGGTATTTTGATTCGAGCCCATCCAGTAGCATGTTT 
ACCAA TAAA TAA TCA TTG GTCG TGCTGA TTCTTA TGG TTG GA GA TGAA TGTATGTAGGGT 
GTACTCGAGCTCGAGTGCTTGTTGTACTGTAGGTTGAGGTTTCTCATCCATTGGCCTGCC 
TA TTTG TG GA TGA CGTTTCA TTTCA GA TTAGCAA TG TG C TTA TTTAA G G TTTCGTCA TG T 
ACCTGTATTCTACAA TCCA CTATTG TTTCCAAA GA CA G CA TTTG A TCCTTAAAAAAAA C T 
GTAAAAAAAAAAAAACAGTGCCCGAAAAGCCGCAAAAAAAAAAAAAAAAAAAACCTGCCC 
GGGCGGCCGCTCGAGCCCTATAGTGAGTAATCGAATTCCC 
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667 KVQNHQPNDVMEYGGSPKTEFQRYIRLSRKDMLDWSFGEGA GPDEGKLLDHQPLRLNNDD 726 

+ +NHQPNDVMEYGGSPKTEFQRYIRL RKDMLDWSFGE A GPDE GKLLDHQPLRLNNDD 
15 RFENHQPNDVMEYG GSPKTEFQRYIRL GRKDMLD WSFGEEA GPDEGKLLDHQPLRLNNDD 74 

727 YERVQ QIP VKKGA NFRDLKG VR VGA NNI VEWDPEIER VKL S S GKPL VPD YA MSFIKGKSL 786 

YER V+QtPVKKGANFRDLKG V+ VGANN+ VEWDPE+ER V LSSGKPLVPDYAMSFIKGKSL 
75 YERVKQtPVKKGANFRDLKGVKVGANNVVEWDPEVERVYLSSGKPLVPDYAMSFIKGKSL 134 

787 KPFGRL WWDETVP TVVTRAEPHN Q VltHP TQA R VL TiRENARLQGFPD YYRLFGPIKEKY 846 

KPFGRLWWD+TVPTVVTRAEPHNQVt+HPTQARVLTIRENARLQGFPDYYRLFGPlKEKY 
135 KPFGRL WWDQTVPTVVTRAEPHNQVILHPTQARVLTIRENARLQGFPDYYRLFGPIKEKY 194 

847 IQVGNAVAVPVARALGYCLGQAYLGESEGSDPLYQLPPSFTSVGGRTAGQARASPVGTPA 906 

IQVGNA VA VPVARALGYCLGQA YLGES+GS PLYQLP SFTSV GRTA QA A+ VGTPA 
195 I Q VGNA VA VPVA RALG YCL G QA YLGESD GSQPL YQLPA SFTS V-GR TA VQA NAAS VG TP A 253 

907 GEWEQ 912 

GEWEQ 
254 GEWEQ 259 
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DECLARATION AND POWER 
OF ATTORNEY FOR UTILITY OR 
DESIGN PATENT APPLICATION 

(37CFR1.63) 

□ Declaration a Declaration 

Submitted Submitted After Initial 
With Initial Filing (surcharge 
Filing (37 CFR 1.16(a)) 
required 


Attorney JJOCKet in o.. wio^o/ruujzuo 


rirsx reamed inventor, ouawn ivi. ivacppici w ui. 




Amplication Number- 09/ 9 14 > 001 


Filing JJdlc, .rvuguai ^vr, ZA/ui 


Group Art Unit: 


Examiner Name: 



As a below-named inventor, I hereby declare that: 

My residence, post office address, and citizenship are as stated below next to my name. 

I believe I am the original, first, and sole inventor (if only one name is listed) or an original, first and joint inventor 
(if plural names are listed) of the subject matter which is claimed and for which a patent is sought on the invention 
entitled: Class II DNA Methvltransferases of Zea mays, the specification of which: 



□ is attached hereto; or 



h was filed on March 10, 2000 as Application Serial No. PCT/US00/06456 and was amended on 

(if applicable). 

I hereby state that I have reviewed and understand the contents of the above-identified specification, including the 
claims, as amended by any amendment referred to above. 

I acknowledge the duty to disclose all information which is material to patentability as defined in 37 CFR. 1.56, 
including for continuation-in-part applications, material information which became available between the filing date 
of the prior application and the national or PCT international filing date of the continuation-in-part application. 

I hereby claim foreign priority benefits under 35 U.S.C. 1 19(a)-(d) or 365(b) of any foreign application(s) for patent 
or inventor's certificate, or 365(a) of any PCT international application which designated at least one country other 
than the United States of America, listed below and have also identified below, by checking the box, any foreign 
application for patent or inventor's certificate, or any PCT international application having a filing date before that of 
the application on which priority is claimed. 



Prior Foreign 
Application Numbers 


Country 


Foreign Filing Date 
(MM/DD/YYY) 


Priority Not 
Claimed 


Certified Copy Attached? 


YES 


NO 








□ 


□ 


□ 








□ 


□ 


□ 








□ 


□ 


o 



□ Additional foreign application numbers are listed on a supplemental priority data sheet attached hereto. 
I hereby claim the benefit of any United States application(s) listed below. 
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Application Number(s) 


Filing Date 


o Additional application numbers are listed 
on a supplemental priority data sheet 
attached hereto. 


60/123,888 
60/169,858 


11 March 1999 
09 December 1999 



The undersigned hereby authorizes the U.S. attorney(s) or agent(s) named herein to accept and follow instructions from 
the assignee, if any, of the undersigned or from as to any action to be taken in the Patent and Trademark Office 
regarding this application without direct communication between the U.S. attorney(s) or agent(s) and the undersigned. 
In the event of a change in the persons from whom instructions may be taken, the U.S. attorney(s) or agent(s) named 
herein will be so notified by the undersigned. 



As a named inventor, I hereby appoint the following registered practitioners) to prosecute this application and to 
transact all business in the Patent and Trademark Office connected therewith: 



Lawrence I Chapa Reg. No. 39,135 

Randall T. Erickson Reg. No. 33,872 

Stephen D. Geimer Reg. No, 28,846 

H. Vincent Harsha Reg. No. 1 8,045 

Allen J. Hoover Reg. No. 24,103 



Martin L. Katz 
Kathleen A. Lyons 
John P. Milnamow 
Lisa V. Mueller 
Paul M. Odell 



Reg. No. 25,011 
Reg. No. 31,852 
Reg. No. 20,635 
Reg. No. 38,978 
Reg. No. 28,332 



Keith V. Rockey 
Thomas I. Ross 
Joel E. Siegel 
Paul M. Vargo 



whose mailing address for this application is: 



ROCKEY, MILNAMOW & KATZ, LTD. 
Two Prudential Plaza - Suite 4700 
180 North Stetson Avenue 
Chicago, Illinois 60601 
Telephone: (312)616-5400 
Facsimile: (312)616-5460 



Reg. No. 24,713 
Reg. No. 29,275 
Reg. No. 25,440 
Reg. No. 29,116 



Customer Number (01942) 
and/or Bar Code LabeL^ 



942) / 



I hereby declare that all statements made herein of my knowledge are true and that all statements made on information 
and belief are believed to be true; and further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, under 18 U.S.C. 1001 and that such 
willful false statements may jeopardize the validity of the application or any patent issued thereon. 



1 



Name of Sole or First Inventor; Y ^^hawn M. Kaeppler 



Citizenship: 



US 



Residence: 



5290 County Highway A, Oregon, Wisconsin 53575 U_/^J_ 



Post Office Address (if different): 



same as above 



Signature 



Date: f/^ A/ 



a A petition has been filed for this unsigned inventor 
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Name of Additional Inventor, if any: 


Nathan M. Springer 




Citizenship: 


US 


/ 


Residence: 


918 Washington Street Northfield, MN 55057 flj j{j 


Post Office Address (if different): 


Same as above 




Sisnamre: fUfc^- 


D a«e: M ?A>1 


□ A petition has been filed for this unsigned inventor. 



n 



ame of Additional Inventor, if any: 



Michael G. Muszynski 



Citizenship: 



US 



Residence: 



5505 Shriver Avenue #2, Johnston, IA 5013 1 



Post Office Address (if different)'. 



same as above 



Signature: 



Date: 



□ A petition has been filed for this unsigned inventor. 



Name of Additional Inventor, if any : 


_Cbarles M. Paga_^ 




Citizenship: 


US 


/ 


Residence: 


903 Beacon Street #1, Madison, WI 53715$/ 


Post Office Address (if different): 






Signature: 


Date: 


□ A petition has been filed for this unsigned inventor. 
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Name of Additional Inventor, if any: 



Citizenship: 



Residence: 



Post Office Address (if different): 



Signature: 



Date: 



□ A petition has been filed for this unsigned inventor. 
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DLCLAKA11UIN AINU rUWILK 


Attnmpv Hr^Vpt >Jn * WT94QR7P00S9T T<s 


OF ATTORNEY FOR UTILITY OR 
DFSTGN PATENT APPLICATION 


rirsi iNaineu lnvciuur. oiiawii ivi. ivacppici c/ uj. 




(37 CFR1.63) 




i— 1 J_yCC lal ctll UII 


K Dpolaration 




Submitted 
With Initial 


Submitted After Initial 
Filing (surcharge 


riling j_ydie. /\u^um zu, z-uui 


Filing 


(37 CFR 1.16(a)) 
required 


Crronn Art Unit* 

VJ1UUL/ i- Willi* 




Examiner Name: 



As a below-named inventor, I hereby declare that: 



My residence, post office address, and citizenship are as stated below next to my name. 

I believe I am the original, first, and sole inventor (if only one name is listed) or an original, first and joint inventor 
(if plural names are listed) of the subject matter which is claimed and for which a patent is sought on the invention 
entitled: Class II DNA Methvltransferases of Zea mays, the specification of which: 

□ is attached hereto; or 

a was filed on March 10, 2000 as Application Serial No. PCTYUS00/06456 and was amended on 

(if applicable). 

I hereby state that I have reviewed and understand the contents of the above-identified specification, including the 
claims, as amended by any amendment referred to above. 

I acknowledge the duty to disclose all information which is material to patentability as defined in 37 CFR. L56, 
including for continuation-in-part applications, material information which became available between the filing date 
of the prior application and the national or PCT international filing date of the continuation-in-part application. 

I hereby claim foreign priority benefits under 35 U.S.C. 1 19(a)-(d) or 365(b) of any foreign application(s) for patent 
or inventor's certificate, or 365(a) of any PCT international application which designated at least one country other 
than the United States of America, listed below and have also identified below, by checking the box, any foreign 
application for patent or inventor's certificate, or any PCT international application having a filing date before that of 
the application on which priority is claimed. 



Prior Foreign 
Application Numbers 


Country 


Foreign Filing Date 
(MM/DD/YYY) 


Priority Not 
Claimed 


Certified Copy Attached? 


YES 


NO 








o 


□ 


□ 








□ 


□ 


□ 








□ 


□ 


□ 



□ Additional foreign application numbers are listed on a supplemental priority data sheet attached hereto. 



I hereby claim the benefit of any United States application(s) listed below. 
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Application Number(s) 


Filing Date 


□ Additional application numbers are listed 
on a supplemental priority data sheet 
attached hereto. 


60/123,888 
60/169,858 


11 March 1999 
09 December 1999 



The undersigned hereby authorizes the U.S. attorney(s) or agent(s) named herein to accept and follow instructions from 
the assignee, if any, of the undersigned or from as to any action to be taken in the Patent and Trademark Office 
regarding this application without direct communication between the U.S. attoraey(s) or agent(s) and the undersigned. 
In the event of a change in the persons from whom instructions may be taken, the U.S. attorney(s) or agent(s) named 
herein will be so notified by the undersigned. 



As a named inventor, I hereby appoint the following registered practitioners) to prosecute this application and to 
transact all business in the Patent and Trademark Office connected therewith: 



Lawrence J. Chapa Reg. No. 39, 1 35 
Randall T. Erickson Reg. No. 33,872 
Stephen D. Geimer Reg. No. 28,846 
H. Vincent Harsha Reg. No. 1 8,045 
Allen J. Hoover Reg. No. 24,103 



Martin L. Katz 
Kathleen A. Lyons 
John P. Milnamow 
Lisa V. Mueller 
Paul M. Odell 



Reg. No. 25,01 1 
Reg. No. 31,852 
Reg. No. 20,635 
Reg. No. 38,978 
Reg. No. 28,332 



Keith V. Rockey 
Thomas I. Ross 
Joel E. Siegel 
Paul M. Vargo 



whose mailing address for this application is: 



ROCKEY, MILNAMOW & KATZ, LTD. 
Two Prudential Plaza - Suite 4700 
180 North Stetson Avenue 
Chicago, Illinois 60601 
Telephone: (312)616-5400 
Facsimile: (312)616-5460 



Customer Number (01942) 
and/or Bar Code Label: 



Reg. No. 24,713 
Reg. No. 29,275 
Reg. No. 25,440 
Reg. No. 29,116 



I hereby declare that all statements made herein of my knowledge are true and that all statements made on information 
and belief are believed to be true; and farther that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, under 1 8 U.S.C. 1 00 1 and that such 
willful false statements may jeopardize the validity of the application or any patent issued thereon. 



Name of Sole or First Inventor: 


Shawn M. Kaeppler 




Citizenship: 


US 




Residence: 


5290 County Highway A, Oregon, Wisconsin 53575 


Post Office Address (if different): 


same as above 




Signature: 


Date: 


□ A petition has been filed for this unsigned inventor. 
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Name of Additional Inventor, if any: 


Nathan M. Springer 




Citizenship: 


US 




Residence: 


918 Washington Street, Northfield, MN 55057 


Post Office Address (if different): 


Same as above 




Signature: 


Date: 


□ A petition has been filed for this unsigned inventor. 



Name of Additional Inventor, if any: 



Michael G. Muszynski 



Citizenship: 



US 



Residence: ?^ /s/jj 7^j% C 'ui i/g, :pf ^3 JffiS-Shriv^^ 1 



Post Office Address (if different): 



same as above 



Signature: 





Date: ^//^h/df 



D A petitionlias been filed for tMs unsigned inventor 



Name of Additional Inventor, if any: 



Charles M. Papa 



Citizenship: 



US 



Residence: 



903 Beacon Street #1, Madison, WI 53715 



Post Office Address (if different): 



Signature: 



Date: 



□ A petition has been filed for this unsigned inventor. 



Name of Additional Inventor, if any: 



Citizenship: 



Residence: 



Post Office Address (if different): 



Signature: 



Date: 



□ A petition has been filed for this unsigned inventor. 
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DECLARATION AND POWER 
OF ATTORNEY FOR UTILITY OR 
DESIGN PATENT APPLICATION 

(37 CFR 1.63) 

□ Declaration b Declaration 

Submitted Submitted After Initial 
With Initial Filing (surcharge 
Filing (37 CFR 1.16(a)) 
required 


Attorney Docket No.: WlMyo7rUU5zUS 


First Named Inventor: Shawn M. Kaeppler et ah 


COMPLETE IF KNOWN 


A^linofinti XTiTmK^ 09/914,001 

Application Number. 5 


Filing Date: August zu, zUUi 


oroup ait unit. 


Examiner Name: 



As a below-named inventor, I hereby declare that: 



My residence, post office address, and citizenship are as stated below next to my name. 

I believe I am the original, first, and sole inventor (if only one name is listed) or an original, first and joint inventor 
(if plural names are listed) of the subject matter which is claimed and for which a patent is sought on the invention 
entitled: Class II DNA Methvltransferases of Zea mays , the specification of which: 

□ is attached hereto; or 

a was filed on March 10. 2000 as Application Serial No. PCT/US00/06456 and was amended on 

(if applicable). 

I hereby state that I have reviewed and understand the contents of the above-identified specification, including the 
claims, as amended by any amendment referred to above. 

I acknowledge the duty to disclose all information which is material to patentability as defined in 37 CFR. 1 .56, 
including for continuation-in-part applications, material information which became available between the filing date 
of the prior application and the national or PCT international filing date of the continuation-in-part application. 

I hereby claim foreign priority benefits under 35 U.S.C. I I9(a)-(d) or 365(b) of any foreign application(s) for patent 
or inventor's certificate, or 365(a) of any PCT international application which designated at least one country other 
than the United States of America, listed below and have also identified below, by checking the box, any foreign 
application for patent or inventor's certificate, or any PCT international application having a filing date before that of 
the application on which priority is claimed. 



Prior Foreign 
Application Numbers 


Country 


Foreign Filing Date 
(MM/DD/YYY) 


Priority Not 
Claimed 


Certified Copy Attached? 


YES 


NO 








□ 


□ 


□ 








□ 


□ 


D 








□ 


□ 


□ 



□ Additional foreign application numbers are listed on a supplemental priority data sheet attached hereto. 



I hereby claim the benefit of any United States application(s) listed below. 
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Application Number(s) 


Filing Date 


□ Additional application numbers are listed 
on a supplemental priority data sheet 
attached hereto. 


60/123,888 
60/169,858 


11 March 1999 
09 December 1999 



The undersigned hereby authorizes the U.S. attorney(s) or agent(s) named herein to accept and follow instructions from 
the assignee, if any, of the undersigned or from as to any action to be taken in the Patent and Trademark Office 
regarding this application without direct communication between the U.S. attorney(s) or agent(s) and the undersigned. 
In the event of a change in the persons from whom instructions may be taken, the U.S. attorney(s) or agent(s) named 
herein will be so notified by the undersigned. 

As a named inventor, I hereby appoint the following registered practitioner(s) to prosecute this application and to 
transact all business in the Patent and Trademark Office connected therewith: 



Lawrence J. Chapa Reg. No. 39,1 35 
Randall T.Erickson Reg. No. 33,872 
Stephen D. Geimer Reg. No. 28,846 
H. Vincent Harsha Reg. No. 1 8,045 
Allen J. Hoover Reg. No. 24, 1 03 



Martin L. Katz 
Kathleen A. Lyons 
John P. Milnamow 
Lisa V. Mueller 
Paul M. Odell 



Reg. No. 25,011 
Reg. No. 31,852 
Reg. No. 20,635 
Reg. No. 38,978 
Reg. No. 28,332 



Keith V. Rockey 
Thomas I. Ross 
Joel E. Siegel 
Paul M. Vargo 



whose mailing address for this application is: 



ROCKEY, MILNAMOW & KATZ, LTD. 
Two Prudential Plaza - Suite 4700 
180 North Stetson Avenue 
Chicago, Illinois 60601 
Telephone: (312)616-5400 
Facsimile: (312)616-5460 



Reg. No. 24,713 
Reg. No. 29,275 
Reg. No. 25,440 
Reg. No. 29,116 



Customer Number (01942) 
and/or Bar Code Label: 



I hereby declare that all statements made herein of my knowledge are true and that all statements made on information 
and belief are believed to be true; and further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, under 18 U.S.C. 1001 and that such 
willful false statements may jeopardize the validity of the application or any patent issued thereon. 



Name of Sole or First Inventor: 


Shawn M. Kaeppler 




Citizenship: 


US 




Residence: 


5290 County Highway A, Oregon, Wisconsin 53575 


Post Office Address (if different): 


same as above 




Signature: 


Date: 


□ A petition has been filed for this 


unsigned inventor. 
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Name of Additional Inventor, if any: 


Nathan M. Springer 




Citizenship: 


US 




Residence: 


918 Washington Street, Northfield, MN 55057 


Post Office Address (if different): 


Same as above 




Signature: 


Date: 


□ A petition has been filed for this unsigned inventor. 




Name of Additional Inventor, if any: 


Michael G. Muszynski 




Citizenship: 


US 




Residence: 


5505 Shriver Avenue #2, Johnston, IA 5013 1 


Post Office Address (if different): 


same as above 




Signature: 


Date: 


□ A petition has been filed for this unsigned inventor. 




Name of Additional Inventor, if any: 


Charles M. Papa 




Citizenship: 


US 




Residence: 


903 Beacon Street #1, Madison, WI 53715 


Post Office Address (if different): 


1590 44th Road,Bellwood, NE 68624 


Signature: ^^^^ 




Date: 2eo\ 


□ A petition has been filed for this unsigned inventor. 



Name of Additional Inventor, if any: 



Citizenship: 



Residence: 



Post Office Address (if different): 



Signature: Date: 



□ A petition has been filed for this unsigned inventor. 
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DECLARATION AND POWER 
OF ATTORNEY FOR UTILITY OR 
DESIGN PATENT APPLICATION 

(37CFR1.63) 

□ Declaration B Declaration 

Submitted Submitted After Initial 
With Initial Filing (surcharge 
Filing (37 CFR 1.16(a)) 
required 


Attorney jjocicet ino.. wio'+yo /ruujzuo 


rirst iNameu invenior. onawn ivi. ivdeppicr ei at. 




AnnlWinn NnmW 09/914,001 

Application iNurnDer. 


riling uate. /-vagus i z,wi 


frrmin Art T Init* 

VJlv'UL' ill I V_/" lilt. 


Examiner Name: 



As a below-named inventor, I hereby declare that: 



My residence, post office address, and citizenship are as stated below next to my name, 

I believe I am the original, first, and sole inventor (if only one name is listed) or an original, first and joint inventor 
(if plural names are listed) of the subject matter which is claimed and for which a patent is sought on the invention 
entitled: Class II DNA Methvltransferases of Zea mays, the specification of which: 

□ is attached hereto; or 

is was filed on March 10, 2000 as Application Serial No. PCT/US00/06456 and was amended on 

(if applicable). 

I hereby state that I have reviewed and understand the contents of the above-identified specification, including the 
claims, as amended by any amendment referred to above. 

I acknowledge the duty to disclose all information which is material to patentability as defined in 37 CFR. 1.56, 
including for continuation-in-part applications, material information which became available between the filing date 
of the prior application and the national or PCT international filing date of the continuation-in-part application. 

I hereby claim foreign priority benefits under 35 U.S.C. 1 19(a)-(d) or 365(b) of any foreign application(s) for patent 
or inventor's certificate, or 365(a) of any PCT international application which designated at least one country other 
than the United States of America, listed below and have also identified below, by checking the box, any foreign 
application for patent or inventor's certificate, or any PCT international application having a filing date before that of 
the application on which priority is claimed. 



Prior Foreign 
Application Numbers 


Country 


Foreign Filing Date 
(MM/DD/YYY) 


Priority Not 
Claimed 


Certified Copy Attached? 


YES 


NO 








□ 


□ 


□ 








□ 


□ 


o 








□ 


□ 


n 



□ Additional foreign application numbers are listed on a supplemental priority data sheet attached hereto. 



I hereby claim the benefit of any United States application(s) listed below. 
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Application Number(s) 


Filing Date 


60/123,888 
60/169,858 


11 March 1999 
09 December 1999 



Additional application numbers are listed 
on a supplemental priority data sheet 
attached hereto. 



The undersigned hereby authorizes the U.S. attorney(s) or agent(s) named herein to accept and follow instructions from 
the assignee, if any, of the undersigned or from as to any action to be taken in the Patent and Trademark Office 
regarding this application without direct communication between the U.S. attorney(s) or agent(s) and the undersigned. 
In the event of a change in the persons from whom instructions may be taken, the U.S. attorney(s) or agent(s) named 
herein will be so notified by the undersigned. 

As a named inventor, I hereby appoint the following registered practitioner(s) to prosecute this application and to 
transact all business in the Patent and Trademark Office connected therewith: 



Lawrence J. Chapa Reg. No. 39, 1 35 
Randall T. Erickson Reg. No. 33,872 
Stephen D. Geimer Reg. No. 28,846 
H. Vincent Harsha Reg. No. 1 8,045 
Allen J. Hoover Reg. No. 24,103 



Martin L. Katz 
Kathleen A. Lyons 
John P. Milnamow 
Lisa V. Mueller 
Paul M. Odell 



Reg. No. 25,011 
Reg. No. 31,852 
Reg. No. 20,635 
Reg. No. 38,978 
Reg. No. 28,332 



Keith V. Rockey 
Thomas I. Ross 
Joel E. Siegel 
Paul M. Vargo 



Reg. No. 24,713 
Reg. No. 29,275 
Reg. No. 25,440 
Reg. No. 29,116 



whose mailing address for this application is: 



ROCKEY, MILNAMOW & KATZ, LTD. 
Two Prudential Plaza - Suite 4700 
1 80 North Stetson Avenue 
Chicago, Illinois 60601 
Telephone: (312)616-5400 
Facsimile: (312)616-5460 



Customer Number (01942) > 
and/or Bar Code%abelL 



I hereby declare that all statements made herein of my knowledge are true and that all statements made on information 
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SEQUENCE LISTING 

<110> Wisconsin Alumni Research Foundation et al . 
Pioneer Hi -Bred, International Inc. 
Regents of The University of Minnesota 
Kaeppler, Shawn M. 
Springer, Nathan M. 
Muszynski, Michael G. 
Papa, Charles M . 

<120> Nucleic Acid and Amino Acid Sequences Encoding Class II 
DNA Methyl transferases 

<130> WIS4987P0051PCT 

<140> PCT/US00/0S456 
<141> 2000-03-10 

p60> 90 

MpL7 0> Patentln Ver. 2.1 
: pio> i 

%ili> 2736 
%212> DNA 
'<*213> Zea mays 

;^40o> i 

Jjtggcgccga gctccccgrc acccgccgcg cctacacgcg tctctgggcg gaagcgcgcc 60 
igccaaggccg aggagatcca ccagaacaag gaggaggagg aggaggtcgc ggcggcgtcc 12 0 
j^ccgccaagc gcagccgcaa ggcggcatct tccgggaaga agcccaagtc gccccccaag 180 
j^aggccaagc cggggaggaa gaagaagggg gatgccgaga tgaaggagcc cgtggaggac 24 0 
gacgtgtgcg ccgaggagcc cgacgaggag gagttggcca tgggcgagga ggaggccgag 300 
gagcaggcca tgcaggagga ggtggttgcg gtcgcggcgg ggtcacccgg gaagaagagg 360 
gtggggagaa ggaacgccgc cgccgccgct ggcgaccacg agccggagtt catcggcagc 420 
cctgttgccg cggacgaggc gcgcagcaac tggcccaagc gctacggccg cagcactgcc 480 
gcaaagaaac cggatgagga ggaagagctc aaggccagat gtcactaccg gagcgctaag 540 
gtggacaacg tcgtctactg cctcggggat gacgtctatg tcaaggctgg agaaaacgag 600 
gcagattaca ttggccgcat tactgaattt tttgagggga ctgaccagtg tcactatttt 660 
acttgccgtt ggttcttccg agcagaggac acggttatca attctttggt gtccataagt 720 
gtggatggcc acaagcatga ccctagacgc gtttttcttt ctgaggaaaa gaacgacaat 780 
gtgcttgatt gcattatctc caaggtcaag atagtccatg ttgatccaaa tatggatcca 840 
aaagccaagg ctcagctgat agagagttgc gacctatact atgacatgtc ttactctgtt 900 
gcatattcta catttgctaa tatctcgtct gaaaatgggc agtcaggcag tgataccgct 960 
tcgggtattt cttctgatga tgtggatctg gagacgtcat ctagtatgcc aacgaggaca 1020 
gcaacccttc ttgatctgta ttctggcrgt gggggcatgt ctactggtct ttgcttgggt 1080 
gcagctcttt ctggcttgaa acttgaaact cgatgggctg ttgatttcaa cagttttgcg 114 0 
tgccaaagtt taaaatataa tcatccacag actgaggtgc gaaatgagaa agccgatgag 1200 
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tttcttgccc tccttaagga atgggcagtt ctatgcaaaa aatatgtcca agatgtggat 1260 

tcaaatttag caagctcaga ggatcaagcg gatgaagaca gccctcttga caaggacgaa 1320 

tttgttgtag agaagcttgt cgggatatgt tatggtggca gtgacaggga aaatggcatc 1380 

tattttaagg tccagtggga aggatacggc cctgaggagg atacatggga accgattgat 144 0 

aacttgagtg actgcccgca gaaaattaga gaatttgtac aagaagggca caaaagaaag 1500 

attctcccac tgcctggtga tgttgatgtc atttgtggag gcccaccatg ccaaggtatc 1560 

agtgggttta atcggtacag aaaccgtgat gagccactca aagatgagaa aaacaaacaa 1620 

atggtgactt tcatggatat tgtggcgtac ttgaagccca agtatgttct catggaaaat 1680 

gtggtggaca tactcaaatt tgcggatggt tacctaggaa aatatgcttt gagctgcctt 1740 

gttgctatga agtaccaagc gcggcttgga atgatggtgg ctggttgcta tggtctgcca 1800 

cagttcagga tgcgtgtgtt cctctggggt gctctttctt ccatggtgct ccctaagtat 1860 

cctctgccca cctatgatgt tgtagtacgt ggaggagccc ctaatgcctt ttcgcaatgt 1920 

atggttgcat atgacgagac acaaaaacca tccctgaaaa aagccttgct tcttggcgat 1980 

gcaatttcag atttaccaaa ggttcaaaat caccagccta acgatgtgat ggagtatggt 2040 

ggttccccca agaccgaatt ccagcgctac attcgactca gtcgtaaaga catgttggat 2100 

tggtccttcg gtgagggggc tggtccagat gaaggcaagc tcttggatca ccagccttta 2160 

cggcttaaca acgatgatta tgagcgggtt caacagattc ctgtcaagaa gggagccaac 2220 

ttccgcgacc taaagggcgt gagggttgga gcaaacaata ttgttgagtg ggatccagaa 2280 

§J|cgagcgtg tgaaactttc atctgggaaa ccactggttc ctgactatgc aatgtcattc 2340 

^tcaagggca aatcactcaa gccgtttggg cgcctgtggt gggacgagac agttcctaca 24 00 

^ttgtaacca gagcagagcc tcacaaccag gttataattc atccgactca agcaagggtc 2460 

hkcactatcc gggagaacgc aaggttacag ggcttccccg attactaccg attgtttggc 2520 

: <fcgatcaagg agaagtacat tcaagtcggg aacgcagtgg ctgtccctgt tgcccgggca 2580 

jHtgggctact gtctggggca agcctacctg ggtgaatctg aggggagtga ccctctgtac 264 0 

: %agctgcctc caagtttcac ctctgttgga ggacgcactg cggggcaggc gagggcctct 2700 

"cctgttggca cccctgcagg ggaggtagtt gagcag 273 6 

h.b 

l<.210> 2 

r fk2 11 > 27 96 

S$ai2> DNA 

s'i<|213> Zea mays 

<4C0> 2 

agagcagcag cagctaccgc agcccctgcc atggcgccga gctccccgtc acccgccgcg 60 
cctacacgcg tctctgggcg gaagcgcgcc gccaaggccg aggagatcca ccagaacaag 120 
gaggaggagg aggaggtcgc ggcggcgtcc tccgccaagc gcagccgcaa ggcggcatct 180 
tccgggaaga agcccaagtc gccccccaag caggccaagc cggggaggaa gaagaagggg 24 0 
gatgccgaga tgaaggagcc cgtggaggac gacgtgtgcg ccgaggagcc cgacgaggag 3 00 
gagttggcca tgggcgagga ggaggccgag gagcaggcca tgcaggagga ggtggttgcg 360 
gtcgcggcgg ggtcacccgg gaagaagagg gtggggagaa ggaacgccgc cgccgccgct 420 
ggcgaccacg agccggagtt catcggcagc cctgttgccg cggacgaggc gcgcagcaac 480 
tggcccaagc gctacggccg cagcactgcc gcaaagaaac cggatgagga ggaagagctc 54 0 
aaggccagat gtcactaccg gagcgctaag gtggacaacg tcgtctactg cctcggggat 600 
gacgtctatg tcaaggctgg agaaaacgag gcagattaca ttggccgcat tactgaattt 660 
tttgagggga ctgaccagtg tcactatttt acttgccgtt ggttcttccg agcagaggac 720 
acggttatca attctttggt gtccataagt gtggatggcc acaagcatga ccctagacgt 780 
gtttttcttt ctgaggaaaa gaacgacaat gtgcttgatt gcattatctc caaggtcaag 840 
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atagtccatg ttgatccaaa tatggatcca aaagccaagg ctcagctgat agagagttgc 900 
gacctatact atgacatgtc ttactctgtt gcatattcta catttgctaa tatctcgtct 960 
gaaaatgggc agtcaggcag tgataccgct tcgggtattt cttctgatga tgtggatctg 1020 
gagacgtcat ctagtatgcc aacgaggaca gcaacccttc ttgatctgta ttctggctgt 1080 
gggggcatgt ctactggtct ttgcttgggt gcagctcttt ctggcttgaa acttgaaact 1140 
cgatgggctg ttgatttcaa cagttttgcg tgccaaagtt taaaatataa tcatccacag 1200 
actgaggtgc gaaatgagaa agccgatgag tttcttgccc tccttaagga atgggcagtt 1260 
ctatgcaaaa aatatgtcca agatgtggat tcaaatttag caagctcaga ggatcaagcg 1320 
gatgaagaca gccctcttga caaggacgaa tttgttgtag agaagcttgt cgggatatgt 1380 
tatggtggca gtgacaggga aaatggcatc tattttaagg tccagtggga aggatacggc 144 0 
cctgaggagg atacatggga accgattgat aacttgagtg actgcccgca gaaaattaga 1500 
gaatttgtac aagaagggca caaaagaaag attctcccac tgcctggtga tgttgatgtc 1560 
anttgtggag gcccaccatg ccaaggtatc agtgggttta atcggtacag aaaccgtgat 1620 
gagccactca aagatgagaa aaacaaacaa atggtgactt tcatggatat tgtggcgtac 1680 
ttgaagccca agtatgttct catggaaaat grggtggaca tactcaaatt tgcggatggt 1740 
tacctaggaa aatatgcttt gagctgcctt gttgctatga agtaccaagc gcggcttgga 1800 
atgatggtgg ctggttgcta tggtctgcca vshyyvshhs yhvhyhyhyy cctctggggt 1860 
g^fctttctt ccatggtgct ccctaagtat cctctgccca cctatgatgt tgtagtacgt 1920 
Igaggagccc ctaatgcctt ttcgcaatgt atggttgcat atgacgagac acaaaaacca 1980 
tfcctgaaaa aagccttgct tcttggcgat gcaatttcag atttaccaaa ggttcaaaat 2040 
||ccagccta acgatgtgat ggagtatggt ggttccccca agaccgaatt ccagcgctac 2100 
4itcgactca gtcgtaaaga catgttggat tggtccttcg gtgagggggc tgguccagat 2160 
gSfaggcaagc tcttggatca ccagccttta cggcttaaca acgatgatta tgagcgggtt 2220 
flacagattc ctgtcaagaa gggagccaac ttccgcgacc taaagggcgt gagggttgga 2280 
fjaaacaata ttgttgagtg ggatccagaa atcgagcgtg tgaaactttc atctgggaaa 2340 
igfecactggttc ctgactatgc aatgtcattc atcaagggca aatcactcaa gccgtttggg 2400 
Igcctgtggt gggacgagac agttcctaca gttgtaacca gagcagagcc tcacaaccag 2460 
S&tataattc atccgactca agcaagggtc ctcactatcc gggagaacgc aaggttacag 2520 
#§cttccccg attactaccg attgtttggc ccgatcaagg agaagtacat tcaagtcggg 2580 
aScgcagtgg ctgtccctgt tgcccgggca ctgggctact gtctggggca agcctacctg 2640 
lltgaatctg aggggagtga ccctctgtac cagctgcctc caagtttcac ctctgttgga 2700 
IJJacgcactg cggggcaggc gagggcctct cctgttggca cccctgcagg ggaggtagtt 2760 
gagcagtaaa aggatgacag atctgagctg agctgg 2796 



<210> 3 
<211> 912 
<212> PRT 
<213> Zea mays 

<400> 3 

Met Ala Pro Ser Ser Pro Ser Pro Ala Ala Pro Thr Arg Val Ser Gly 
i 5 10 15 

Arg Lys Arg Ala Ala Lys Ala Glu Glu He His Gin Asn Lys Glu Glu 
20 25 30 

Glu Glu Glu Val Gly Ala Ala Ala Ser Ser Ala Lys Arg Ser Arg Lys 
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35 40 45 

Ala Ala Ser Ser Gly Lys Lys Pro Lys Ser Pro Pro Lys Gin Ala Lys 
50 55 60 

Pro Gly Lys Lys Lys Gly Asp Ala Glu Met Lys Glu Pro Val Glu Asp 
65 70 75 80 

Asp Val Cys Ala Glu Glu Pro Asp Glu Glu Glu Leu Ala Met Gly Glu 
85 90 95 

Glu Glu Ala Glu Glu Gin Ala Met Gin Glu Glu Val Val Ala Val Ala 

100 105 110 

Ala Gly Ser Pro Gly Lys Lys Arg Val Gly Arg Arg Asn Ala Ala Ala 
115 120 125 

Ala Ala Gly Asp His Glu Pro Glu Phe lie Gly Ser Pro Val Ala Ala 
O 130 135 140 

yBsp Glu Ala Arg Ser Asn Trp Pro Lys Arg Tyr Gly Arg Ser Thr Ala 

^5 150 155 160 

;|gLa Lys Lys Pro Asp Glu Glu Glu Glu Leu Lys Ala Arg Cys His Tyr 
165 170 175 

; '^rg Ser Ala Lys Val Asp Asn Val Val- Tyr Cys Leu Gly Asp Asp Val 
!*!? 180 185 190 

j§iyr Val Lys Ala Gly Glu Asn Glu Ala Asp Tyr lie Gly Arg lie Thr 
m 195 200 205 

Glu Phe Phe Glu Gly Thr Asp Gin Cys His Tyr Phe Thr Cys Arg Trp 
210 215 220 

Phe Phe Arg Ala Glu Asp Thr Val lie Asn Ser Leu Val Ser lie Ser 
225 230 235 240 

Val Asp Gly His Lys His Asp Pro Arg Arg Val Phe Leu Ser Glu Glu 
245 250 255 

Lys Asn Asp Asn Val Leu Asp Cys lie lie Ser Lys Val Lys lie Val 
260 265 270 

His Val Asp Pro Asn Met Asp Pro Lys Ala Lys Ala Gin Leu He Glu 
275 280 285 

Ser Cys Asp Leu Tyr Tyr Asp Met Ser Tyr Ser Val Ala Tyr Ser Thr 
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290 295 300 

Phe Ala Asn He Ser Ser Glu Asn Gly Gin Ser Gly Ser Asp Thr Ala 
305 310 315 320 

Ser Gly He Ser Ser Asp Asp Val Asp Leu Glu Thr Ser Ser Ser Met 
325 330 335 

Pro Thr Arg Thr Ala Thr Leu Leu Asp Leu Tyr Ser Gly Cys Gly Gly 
340 345 350 

Met Ser Thr Gly Leu Cys Leu Gly Ala Ala Leu Ser Gly Leu Lys Leu 
355 360 365 

Glu Thr Arg Trp Ala Val Asp Phe Asn Ser Phe Ala Cys Gin Ser Leu 
370 375 380 

Lys Tyr Asn His Pro Gin Thr Glu Val Arg Asn Glu Lys Ala Asp Glu 
|B5 390 395 400 

^Jie Leu Ala Leu Leu Lys Glu Trp Ala Val Leu Cys Lys Lys Tyr Val 
N : 405 410 415 

jgln Asp Val Asp Ser Asn Leu Ala Ser Ser Glu Asp Gin Ala Asp Glu 

M" 420 425 430 

J^sp Ser Pro Leu Asp Lys Asp Glu Phe Val Val Glu Lys Leu Val Gly 
435 440 445 

I^Le Cys Tyr Gly Gly Ser Asp Arg Glu Asn Gly He Tyr Phe Lys Val 
■v| 450 455 460 

II I" 

''3ln Trp Glu Gly Tyr Gly Pro Glu Glu Asp Thr Trp Glu Pro lie Asp 
465 470 475 480 

Asn Leu Ser Asp Cys Pro Gin Lys He Arg Glu Phe Val Gin Glu Gly 
485 490 495 

Kis Lys Arg Lys lie Leu Pro Leu Pro Gly Asp Val Asp Val He Cys 
500 505 510 

Gly Gly Pro Pro Cys Gin Gly lie Ser Gly Phe Asn Arg Tyr Arg Asn 
515 520 525 

Arg Asp Glu Pro Leu Lys Asp Glu Lys Asn Lys Gin Met Val Thr Phe 
530 535 540 

Met Asp lie Val Ala Tyr Leu Lys Pro Lys Tyr Val Leu Met Glu Asn 
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545 



550 555 560 



Val Val Asp He Leu Lys Phe Ala Asp Gly Tyr Leu Gly Lys Tyr Ala 
565 570 575 

Leu Ser Cys Leu Val Ala Met Lys Tyr Gin Ala Arg Leu Gly Met Met 
580 585 590 

Val Ala Gly Cys Tyr Gly Leu Pro Gin Phe Arg Met Arg Val Phe Leu 
595 600 605 

Trp Gly Ala Leu Ser Ser Met Val Leu Pro Lys Tyr Pro Leu Pro Thr 
610 €15 620 

Tyr Asp Val Val Val Arg Gly Gly Ala Pro Asn Ala Phe Ser Gin Cys 
625 630 635 640 

Mpt Val Ala Tyr Asp Glu Thr Gin Lys Pro Ser Leu Lys Lys Ala Leu 
^ 645 650 655 

%\ 
, 

jljeu Leu Gly Asp Ala He Ser Asp Leu Pro Lys Val Gin Asn His Gin 

*Z* 660 665 670 

i§ro Asn Asp Val Met Glu Tyr Gly Gly Ser Pro Lys Thr Glu Phe Gin 



675 680 685 



T Y r Ile Ar 9 Leu Ser Ar 9 Lys Asp Met Leu Asp Trp Ser Phe Gly 
690 695 700 

CBlu Gly Ala Gly Pro Asp Glu Gly Lys Leu Leu Asp His Gin Pro Leu 
©05 710 715 720 

m 

Arg Leu Asn Asn Asp Asp Tyr Glu Arg Val Gin Gin Ile Pro Val Lys 
725 730 735 

Lys Gly Ala Asn Phe Arg Asp Leu Lys Gly Val Arg Val Gly Ala Asn 
740 745 750 

Asn lie Val Glu Trp Asp Pro Glu Ile Glu Arg Val Lys Leu Ser Ser 
755 760 765 

Gly Lys Pro Leu Val Pro Asp Tyr Ala Met Ser Phe Ile Lys Gly Lys 
770 775 780 

Ser Leu Lys Pro Phe Gly Arg Leu Trp Trp Asp Glu Thr Val Pro Thr 
785 790 795 800 

Val Val Thr Arc Ala Glu Pro His Asn Gin Val Ile Ile His Pro Thr 
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805 810 815 

Gin Ala Arg Val Leu Thr lie Arg Glu Asn Ala Arg Leu Gin Gly Phe 
820 825 830 

Pro Asp Tyr Tyr Arg Leu Phe Gly Pro lie Lys Glu Lys Tyr lie Gin 
835 840 845 

Val Gly Asn Ala Val Ala Val Pro Val Ala Arg Ala Leu Gly Tyr Cys 
850" 855 860 

Leu Gly Gin Ala Tyr Leu Gly Glu Ser Glu Gly Ser Asp Pro Leu Tyr 
865 870 875 880 

Gin Leu Pro Pro Ser Phe Thr Ser Val Gly Gly Arg Thr Ala Gly Gin 
885 890 895 

Ala Arg Ala Ser Pro Val Gly Thr Pro Ala Gly Glu Val Val Glu Gin 

t:i 900 905 910 



1 C > 4 
^211> 922 
%212> PRT 
';^2 1 3 > Zea mays 

!l 400> 4 

:^Jrg Ala Ala Ala Ala Thr Ala Ala Pro Ala Met Ala Pro Ser Ser Pro 
j^f 15 10 15 

Ser Pro Ala Ala Pro Thr Arg Val Ser Gly Arg Lys Arg Ala Ala Lys 
20 25 30 

Ala Glu Glu lie Kis Gin Asn Lys Glu Glu Glu Glu Glu Val Ala Ala 
35 40 45 

Ala Ser Ser Ala Lys Arg Ser Arg Lys Ala Ala Ser Ser Gly Lys Lys 
50 55 60 

Pro Lys Ser Pro Pro Lys Gin Ala Lys Pro Gly Arg Lys Lys Lys Gly 
65 70 75 SO 

Asp Ala Glu Met Lys Glu Pro Val Glu Asp Asp Val Cys Ala Glu Glu 
85 90 95 
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Pro Asp Glu Glu Glu Leu Ala Met Gly Glu Glu Glu Ala Glu Glu Gin 
100 105 110 

Ala Met Gin Glu Glu Val Val Ala Val Ala Ala Gly Ser Pro Gly Lys 
115 120 125 

Lys Arg Val Gly Arg Arg Asn Ala Ala Ala Ala Ala Gly Asp His Glu 
130 135 140 

Pro Glu Phe He Gly Ser Pro Val Ala Ala Asp Glu Ala Arg Ser Asn 
145 150 155 160 

Trp Pro Lys Arg Tyr Gly Arg Ser Thr Ala Ala Lys Lys Pro Asp Glu 
165 170 175 

Glu Glu Glu Leu Lys Ala Arg Cys His Tyr Arg Ser Ala Lys Val Asp 
180 185 190 

J$|n Val Val Tyr Cys Leu Gly Asp Asp Val Tyr Val Lys Ala Gly Glu 
195 200 205 

4sn Glu Ala Asp Tyr He Gly Arg He Thr Glu Phe Phe Glu Gly Thr 
210 215 220 

^? 

^sp Gin Cys His Tyr Phe Thr Cys Arg Trp Phe Phe Arg Ala Glu Asp 
£25 230 235 240 

i^ir Val He Asn Ser Leu Val Ser lie Ser Val Asp Gly His Lys His 
245 250 255 

m 

||jfep Pro Arg Arg Val Phe Leu Ser Glu Glu Lys Asn Asp Asn Val Leu 
l|j 260 265 270 

Asp Cys He He Ser Lys Val Lys He Val His Val Asp Pro Asn Met 
275 280 285 

Asp Pro Lys Ala Lys Ala Gin Leu He Glu Ser Cys Asp Leu Tyr Tyr 
290 295 300 

Asp Met Ser Tyr Ser Val Ala Tyr Ser Thr Phe Ala Asn He Ser Ser 
305 310 315 320 

Glu Asn Gly Gin Ser Gly Ser Asp Thr Ala Ser Gly He Ser Ser Asp 
325 330 335 

Asp Val Asp Leu Glu Thr Ser Ser Ser Met Pro Thr Arg Thr Ala Thr 
340 345 350 
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Leu Leu Asp Leu Tyr Ser Gly Cys Gly Gly Met Ser Thr Giy Leu Cys 
355 360 365 

Leu Gly Ala Ala Leu Ser Gly Leu Lys Leu Glu Thr Arg Trp Ala Val 
370 375 380 

Asp Phe Asn Ser Phe Ala Cys Gin Ser Leu Lys Tyr Asn His Pro Gin 
385 390 395 400 

Thr Glu Val Arg Asn Glu Lys Ala Asp Glu Phe Leu Ala Leu Leu Lys 
405 410 415 

Glu Trp Ala Val Leu Cys Lys Lys Tyr Val Gin Asp Val Asp Ser Asn 
420 425 430 

Leu Ala Ser Ser Glu Asp Gin Ala Asp Glu Asp Ser Pro Leu Asp Lys 
435 440 445 

Glu Phe Val Val Glu Lys Leu Val Gly lie Cys Tyr Gly Gly Ser 

[% 450 455 460 

jfESp Arg Glu Asn Gly lie Tyr Phe Lys Val Gin Trp Glu Gly Tyr Gly 

im55 470 475 480 

IBzo Glu Glu Asp Thr Trp Glu Pro He Asp Asn Leu Ser Asp Cys Pro 
^ 485 490 495 

i&ln Lys He Arg Glu Phe Val Gin Glu Gly His Lys Arg Lys He Leu 
500 505 510 

'-. 

Lpro Leu Pro Gly Asp Val Asp Val lie Cys Gly Gly Pro Pro Cys Gin 
OJ 515 520 525 

Gly He Ser Gly Phe Asn Arg Tyr Arg Asn Arg Asp Glu Pro Leu Lys 
530 535 * 540 

Asp Glu Lys Asn Lys Gin Met Val Thr Phe Met Asp He Val Ala Tyr 
545 550 555 560 

Leu Lys Pro Lys Tyr Val Leu Men Glu Asn Val Val Asp lie Leu Lys 
565 570 575 

Phe Ala Asp Gly Tyr Leu Gly Lys Tyr Ala Leu Ser Cys Leu Val Ala 
580 585 590 

Met Lys Tyr Gin Ala Arg Leu Gly Met Met Val Ala Gly Cys Tyr Gly 
595 600 605 
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Leu Pro Gin Phe Arg Met Arg Val Phe Leu Trp Gly Ala Leu Ser Ser 
610 615 620 

Met Val Leu Pro Lys Tyr Pro Leu Pro Thr Tyr Asp Val Val Val Arg 
625 630 " 635 640 

Gly Gly Ala Pro Asn Ala Phe Ser Gin Cys Met Val Ala Tyr Asp Glu 
645 650 655 

Thr Gin Lys Pro Ser Leu Lys Lys Ala Leu Leu Leu Gly Asp Ala He 
660 665 670 

Ser Asp Leu Pro Lys Val Gin Asn His Gin Pro Asn A>sp Val Met Glu 
675 680 685 

Tyr Gly Gly Ser Pro Lys Thr Glu Phe Gin Arg Tyr He Arg Leu Ser 
690 695 700 

||g Lys Asp Met Leu Asp Trp Ser Phe Gly Glu Gly Ala Gly Pro Asp 
J 5 710 715 720 

<3Ju Gly Lys Leu Leu Asp His Gin Pro Leu Arg Leu Asn Asn Asp Asp 
m. 725 730 735 

fefrr Glu Arg Val Gin Gin He Pro Val Lys Lys Gly Ala Asn Phe Arg 
s£ 74C 745 750 

ksp Leu Lys Gly Val Arg Val Gly Ala Asn Asn He Val Glu Trp Asp 
hh 755 760 765 

Bio Glu He Glu Arg Val Lys Leu Ser Ser Gly Lys Pro Leu Val Pro 

lli 770 775 780 

Asp Tyr Ala Met Ser Phe He Lys Gly Lys Ser Leu Lys Pro Phe Gly 

785 790 795 800 

Arg Leu Trp Trp Asp Glu Thr Val Pro Thr Val Val Thr Arg Ala Glu 
805 810 815 

Pro His Asn Gin Val He He His Pro Thr Gin Ala Arg Val Leu Thr 
820 825 830 

He Arg Glu Asn Ala Arg Leu Gin Gly Phe Pro Asp Tyr Tyr Arg Leu 
835 840 845 

Phe Gly Pro He Lys Glu Lys Tyr He Gin Val Gly Asn Ala- Val Ala 
850 855 860 
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Val Pro Val Ala 
865 

Gly Glu Ser Glu 



Thr Ser Val Gly 
900 

Gly Thr Pro Ala 
915 



Arg Ala Leu Gly 
870 

Gly Ser Asp Pro 
885 

Gly Arg Thr Ala 



Gly Glu Val Val 
920 



Tyr Cys Leu Gly 
875 

Leu Tyr Gin Leu 
890 

Gly Gin Ala Arg 
905 

Glu Gin 



Gin Ala Tyr Leu 
880 

Pro Pro Ser Phe 
895 

Ala Ser Pro Val 
910 



<210> 5 
<211> 9 
<212> PRT 
<213> Zea mays 

y°°> 5 

ftfs Asp Asp Arg Ser Glu Leu Ser Trp 
■4^ i 5 

||l0> 6 

||ll> 27 

<212> DNA 

Si&13> Artificial Sequence 

j|?23> Description of Artificial Sequence: This sequence 

m was artificially synthesized based on the sequence 

flj of Zea mays . 

<400> 6 

tggttgctat ggtctgccac agttcag 



<210> 7 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays . 

<400> 7 
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ccagctcagc tcagatctgt catccttt 

<210> 8 
<211> 24 
<2i2> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays . 

<400-> 8 

cgaaagctaa tctacacaaa cage 



<:210> 9 
^|ll> 24 
1jl2> DNA 

if%13> Artificial Sequence 

m 220> 

;§223> Description of Artificial Sequence: This sequence 
(J. was artificially synthesized based on the sequence 

of Zea mays . 

f^4 G0> 9 

l^atcctctga gettgetaaa tttg 



|}<f210> 10 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays. - 

<400> 10 

ctcatcttqg agtggctcat cac 



<210> 11 
<211> 22 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays. 

<400> 11 

gagcacatga gggagagtgt tg 



<210> 12 
<211> 21 
<212> DNA. 

<213> Artificial Sequence 
<220> 

&&23> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays. 

: p00> 12 

i^fctctaattt tctgcgggca g 

hit 

^<210> 13 

^bll> 24 

N 5 212> DNA 

H ? 213> Artificial Sequence 

m 

"■4 '• 

M2 2 0 > 

= 1*223 > Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays . 

<400> 13 

cctctgccca cctatgatgt tgta 



<210> 14 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays . 



13 



<400> 14 

taaagggcgt gagggttgga 



<210> 15 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 



<2ZU> 

<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 



of Zea mays . 
<400> 15 

tcacatttgt catggcaggt tatc 



,^210> 16 

=4-211> 24 

S J#212> DNA 

fel3> Artificial Sequence 

^? 

j#2 2 0 > 

,;.<223> Description of Artificial Sequence: This sequence 

m was artificially synthesized based on the sequence 

M J of Zea mays . 

!p400> 16 

ffitgaggaaaa gaacgacaat gtgc 



<210> 17 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays . 

<400> 17 

gcaatcaagc acattgtcgt tcttttcctc 



<210> 18 
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<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays . 

<400> 18 

gaagaagagg gtggggagaa ggaacg 



<210> 19 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
^22 0> 



%<223> Description of Artificial Sequence: This sequence 
^ was artificially synthesized based on the sequence 

of Zea mays . 



^40 0 > 19 

il^fctctttgcgg cagtgctgcg 



y<:210> 20 
i^<211> 28 

f^212> DNA 

"■ '4 : 

;S*ic213> Artificial Sequence 

ill 

' V <220> 

<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays . 

<400> 20 

gtattgaatt gattctcaac tagtgcac 



<210> 21 

<211> 17 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial _ Sequence : This sequence 

15 



was artificially synthesized based on the sequence 
of Zea mays. 

<400> 21 

caggctcaac ggcgatg 

<210> 22 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays . 

<400> 22 

Qiatgcttcat cacatagacc caagtc 

Nf-210> 2 3 

;fi:211> 2 8 

^212 > DNA 

N'213> Artificial Sequence 
l U220> 

^223 > Description of Artificial Sequence: This sequence 

T. : was artificially synthesized based on the sequence 

\yi of Zea mays . 

=^4 00> 2 3 

'gatagaccta atgccaaatg agattaag 

<210> 24 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays . 

<400> 24 

gcgatcttca gtctccacca tc 
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<210> 25 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays . 

<400> 25 

gaagacgtgc ctccatgttt catc 



<210> 26 
<211> 21 
<212> DNA 
M213> Artificial Sequence 

;I4 2 2 o > 

-^<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays . 

^<:400> 26 

; ¥ gttggttctt ccgagcagag g 

hA 
h& 

:^f210> 2 7 

fi^211> 25 

2 1 2 > DNA 
"<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays . 

<40Q> 27 

gactgccaca tatcttatta atcgc 



<210> 28 

<;211> 26 

<2I2> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence; This sequence 
was artificially synthesized based on the sequence 
of Zea mays . 

<400> 28 

gcatgtgtca gcaattgctt acattc 



<210> 29 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays . 

,1X:400> 23 
S M 

=.;octctgctcg gaagaaccaa c 



i?k210> 3 0 
y5c211> 24 
5S <212> DNA 

lj*|c213> Artificial Sequence 

M ! 

M< 2 2 0 > 

|J"<223> Description of Artificial Sequence: This sequence 

was artificially synthesized based on the sequence 
iy of Zea mays. 

<400> 30 

ctgttcggag attcatgcat gatg 



<210> 31 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays. 

<400> 31 
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ggagaacaga atggttgatt caatgg 



<210> 32 
<211> 23 
<2I2> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays. 

<400> 32 

gcacttcact ctcctggcaa acc 



<210> 33 

H*<211> 21 

*k 2 12 > DNA 

"H:213> Artificial Sequence 



^22 3 > Description of Artificial Sequence: This sequence 
T 7 f was artificially synthesized based on the sequence 

of Zea mays . 



\<400> 33 



^cggtacgctg ctgctgctct c 

ff^210> 34 
' "<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays. 

<400> 34 

ccatagcatc tcacatatcg caagg 



<210> 35 
<2il> 28 
<212> DNA 
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<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: This sequence 
was artificially -synthesized based on the sequence 
of Zea mays . 

<400> 35 

ggaaagaagg cagttagttg taaatggg 



<210> 36 

<211> 32 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays . 



-<400> 36 

'Jjfgagaagcca acgccawcgc ctcyatttcg tc 

VT <210> 37 
■^21 1> 2 5 
T^212> DNA 

\ <213> Artificial Sequence 
2 2 0 > 

pf<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays . 

<400> 37 

ctacaacatc atagttgggc agagg 



<210> 


38 


<211> 


23 


<212> 


DNA 


<213> 


Art i f icial 


<220> 





<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays . 
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<400> 38 

actcactata gggctcgagc ggc 



<210> 39 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays. 

<400> 39 

taatacgact cactataggg 



$4210 > 4 0 
: I4211> 19 
: !|212> DNA 

'||213> Artificial Sequence 

m 

! s §220> 

; '<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 



Zea mays . 



^400> 40 

;%atttaggtg acactatag 



<210> 41 
<211> 17 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays . 



<400> 41 

gttttcccag tcacgac 



<210> 42 
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# 



<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: This sequence 
was artificially synthesized based on the sequence 
of Zea mays . 

<400> 42 

caggaaacag ctatgac 



<210> 43 
<211> 912 
<212> PRT 
<213> Zea mays 



Ck400> 43 



Cket Ala Pro Ser Ser Pro Ser Pro Ala Ala Pro Thr Arg Val Ser Gly 

p 15 10 15 

5 l*krg Lys Arg Ala Ala Lys Ala Glu Glu lie His Gin Asn Lys Glu Glu 
Q 2 0 2 5 3 0 

-^Glu Glu Glu Val Ala Ala Ala Ser Ser Ala Lys Arg Ser Arg Lys Ala 
: \ 35 40 45 



r,.Aia Ser Ser Gly Lys Lys Pro Lys Ser Pro Pro Lys Gin Ala Lys Pro 

!£: 50 55 SO 

^ ? fely Arg Lys Lys Lys Gly Asp Ala Glu Met Lys Glu Pro Val Glu Asp 

' 65 70 75 80 

Asp Val Cys Ala Glu Glu Pro Asp Glu Glu Glu Leu Ala Met Gly Glu 
85 90 95 

Glu Glu Ala Glu Glu Gin Ala Met Gin Glu Glu Val Val Ala Val Ala 
100 105 110 

Ala Gly Ser Pro Gly Lys Lys Arg Val Gly Arg Arg Asn Ala Ala Ala 
115 120 125 

Ala Ala Gly Asp His Glu Pro Glu Phe lie Gly Ser Pro Val Ala Ala 
130 135 140 

Asp Glu Ala Arg Ser Asn Trp Pro Lys Arg Tyr Gly Arg Ser Thr Ala 
145 150 155 160 
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Ala Lys Lys Pro Asp Glu Glu Glu Glu Leu Lys Ala Arg Cys His Tyr 
165 170 175 

Arg Ser Ala Lys Val Asp Asn Val Val Tyr Cys Leu Gly Asp Asp Val 
180 185 190 

Tyr Tyr Lys Ala Gly Glu Asn Glu Ala Asp Tyr He Gly Arg He Thr 
195 200 205 

Glu Phe Phe Glu Gly Thr Asp Gin Cys His Tyr Phe Thr Cys Arg Trp 
210 215 220 

Phe Phe Arg Ala Glu Asp Thr Val He Asn Ser Leu Val Ser He Ser 
225 230 235 240 

Val Asp Gly His Lys His Asp Pro Arg Arg Val Phe Leu Ser Glu Glu 
245 250 255 

'%ys Asn Asp Asn Val Leu Asp Cys lie lie Ser Lys Val Lys He Val 
^ 260 265 270 

lift is Val Asp Pro Asn Met Asp Pro Lys Ala Lys Ala Gin Leu He Glu 
275 280 285 

I'ser Cys Asp Leu Tyr Tyr Asp Met Ser Tyr Ser Val Ala Tyr Ser Thr 
m 2 9 0 295 300 

i^he Ala Asn He Ser Ser Glu Asn Gly Gin Ser Gly Ser Asp Thr Ala 
m05 310 315 320 

lifer Gly He Ser Ser Asp Asp Val Asp Leu Glu Thr Ser Ser Ser Met 
325 330 335 

Pro Thr Arg Thr Ala Thr Leu Leu Asp Leu Tyr Ser Gly Cys Gly Gly 
340 345 350 

Met Ser Thr Gly Leu Cys Leu Gly Ala Ala Leu Ser Gly Leu Lys Leu 
355 360 365 

Glu Thr Arg Trp Ala Val Asp Phe Asn Ser Phe Ala Cys Gin Ser Leu 
370 375 380 

Lys Tyr Asn His Pro Gin Thr Glu Val Arg Asn Glu Lys Ala Asp Glu 
385 390 395 400 

Phe Leu Ala Leu Leu Lys Glu Trp Ala Val Leu Cys Lys Lys Tyr Val 
405 410 415 
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Gin Asp Val Asp Ser Asn Leu Ala Ser Ser Glu Asp Gin Ala Asp Glu 
420 425 430 

Asp Ser Pro Leu Asp Lys Asp Glu Phe Val Val Glu Lys Leu Val Gly 
435 440 445 

lie Cys Tyr Gly Gly Ser Asp Arg Glu Asn Gly lie Tyr Phe Lys Val 
450 455 460 

Gin Trp Glu Gly Tyr Gly Pro Glu Glu Asp Thr Trp Glu Pro He Asp 
465 470 475 480 

Asn Leu Ser Asp Cys Pro Gin Lys He Arg Glu Phe Val Gin Glu Gly 
485 490 495 

His Lys Arg Lys He Leu Pro Leu Pro Gly Asp Val Asp Val He Cys 
500 505 510 

*|3ly Gly Pro Pro Cys Gin Gly He Ser Gly Phe Asn Arg Tyr Arg Asn 
515 520 525 

! fkrg Asp Glu Pro Leu Lys Asp Glu Lys Asn Lys Gin Met Val Thr Phe 
© 530 535 540 

r *Met Asp He Val Ala Tyr Leu Lys Pro Lys Tyr Val Leu Met Glu Asn 

^45 550 * 555 560 

! yal Val Asp He Leu Lys Phe Ala Asp Gly Tyr Leu Gly Lys Tyr Ala 

!j 565 570 575 

Hi 

m 

Sleu Ser Cys Leu Val Ala Met Lys Tyr Gin Ala Arg Leu Gly Met Met 
580 585 590 

Val Ala Gly Cys Tyr Gly Leu Pro Gin Phe' Arg Met Arg Val Phe Leu 
595 600 605 

Trp Gly Ala Leu Ser Ser Met Val Leu Pro Lys Tyr Pro Leu Pro Thr 
610 615 620 

Tyr Asp Val Val Val Arg Gly Gly Ala Pro Asn Ala Phe Ser Gin Cys 
625 630 635 640 

Met Val Ala Tyr Asp Glu Thr Gin Lys Pro Ser Leu Lys Lys Ala Leu 
645 650 655 

Leu Leu Gly Asp Ala lie Ser Asp Leu Pro Lys Val Gin Asn His Gin 
660 665 670 
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• 



Pro Asn Asp Val Met Glu Tyr Gly Gly Ser Pro Lys Thr Glu Phe Gin 
675 680 685 

Arg Tyr He Arg Leu Ser Arg Lys Asp Met Leu Asp Trp Ser Phe Gly 
690 695 700 

Glu Gly Ala Gly Pro Asp Glu Gly Lys Leu Leu Asp His Gin Pro Leu 
705 710 715 720 

Arg Leu Asn Asn Asp Asp Tyr Glu Arg Val Gin Gin He Pro Val Lys 
725 730 735 

Lys Gly Ala Asn Phe Arg Asp Leu Lys Gly Val Arg Val Gly Ala Asn 
740 745 750 

Asn He Val Glu Trp Asp Pro Glu He Glu Arg Val Lys Leu Ser Ser 
m 755 760 765 

Lys Pro Leu Val Pro Asp Tyr Ala Met Ser Phe He Lys Gly Lys 
M*. 770 775 780 

!per Leu Lys Pro Phe Gly Arg Leu Trp Trp Asp Glu Thr Val Pro Thr 

©85 790 795 800 

; Val Val Thr Arg Ala Glu Pro His Asn Gin Val He He His Pro Thr 



805 810 815 



tSln Ala Arg Val Leu Thr Leu Arg Glu Asn Ala Arg Leu Gin Gly Phe 
J; 820 825 830 

"¥ro Asp Tyr Tyr Arg Leu Phe Gly Pro He Lys Glu Lys Tyr lie Gin 
835 840 845 

Val Gly Asn Ala Val Ala Val Pro Val Ala Arg Ala Leu Gly Tyr Cys 
850 855 860 

Leu Gly Gin Ala Tyr Leu Gly Glu Ser Glu Gly Ser Asp Pro Leu Tyr 
865 870 875 880 

Gin Leu Pro Pro Ser Phe Thr Ser Val Gly Gly Arg Thr Ala Gly Gin 
885 890 895 

Ala Arg Ala Ser Pro Val Gly Thr Pro Ala Gly Glu Val Val Glu Gin 
900 905 910 
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<210> 44 

<211> 791 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 44 

Met Ala Ala Arg Asn Lys Gin Lys Lys Arg Ala Glu Pro Glu Ser isp 
1 5 10 15 

Leu Cys Phe Ala Gly Lys Pro Met Ser Val Val Glu Ser Thr He \rg 
20 25 30 

Trp Pro His Arg Tyr Gin Ser Lys Lys Thr Lys Leu Gin Ala Pro Thr 
35 40 45 

^ys Lys Pro Ala Asn Lys Gly Gly Lys Lys Glu Asp Glu Glu He He 
50 55 60 

rtys Gin Ala Lys Cys His Phe Asp Lys Ala Leu Val Asp Gly Val Leu 
K 6 5 70 75 80 

Sile Asn Leu Asn Asp Asp Val Tyr Val Thr Gly Leu Pro Gly Lys Leu 



85 



95 



ppys Phe He Ala Lys Val He Glu Leu Phe Glu Ala Asp Asp Gly Val 
J 100 105 HO 

plpro Tyr Cys Arg Phe Arg Trp Tyr Tyr Arg Pro Glu Asp Thr Leu He 
h 115 120 125 

Glu Arg Phe Ser His Leu Val Gin Pro Lys Arg Val Phe Leu Ser Asn 
130 135 140 

Asp Glu Asn Asp Asn Pro Leu Thr Cys He Trp Ser Lys Val Asn He 
145 150 155 160 

Ala Lys Val Pro Leu Pro Lys He Thr Ser Arg He Glu Gin Arg Val 
165 170 175 

He Pro Pro Cys Asp Tyr Tyr Tyr Asp Met Lys Tyr Glu Val Pro Tyr 
180 185 190 

Leu Asn Phe Thr Ser Ala Asp Asp Gly Ser Asp Ala Ser Ser Ser Leu 
195 200 205 

Ser Ser Asp Ser Ala Leu Asn Cys Phe Glu Asn Leu His Lys Asp Glu 
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210 215 220 

Lys Phe Leu Leu Asp Leu Tyr Ser Gly Cys Gly Ala Met Ser Thr Gly 
225 230 235 240 

Phe Cys Met Gly Ala Ser lie Ser Gly Val Lys Leu lie Thr Lys Trp 
245 250 255 

Ser Val Asp lie Asn Lys Phe Ala Cys Asp Ser Leu Lys Leu Asn His 
260 265 270 

Pro Glut Thr Glu Val Arg Asn Glu Ala Ala Glu Asp Phe Leu Ala Leu 
275 280 285 

Leu Lys Glu Trp Lys Arg Leu Cys Glu Lys Phe Ser Leu Val Ser Ser 
290 295 300 

.^Thr Glu Pro Val Glu Ser He Ser Glu Leu Glu Asp Glu Glu Val Glu 
;j$05 310 315 320 

m 

iplu. Asn Asp Asp He Asp Glu Ala Ser Thr Gly Ala Glu Leu Glu Pro 
325 330 335 

-Mi 

ffply Glu Phe Glu Val Glu Lys Phe Leu Gly He Met Phe Gly Asp Pro 
S 340 345 350 

pJ5ln Gly Thr Gly Glu Lys Thr Leu Gin Leu Met Val Arg Trp Lys Gly 
hk 355 360 365 

(pTyr Asn Ser Ser Tyr Asp Thr Trp Glu Pro Tyr Ser Gly Leu Gly Asn 
Q 370 375 380 

m 

Cys Lys Glu Lys Leu Lys Glu Tyr Val He Asp Gly Phe Lys Ser His 
385 390 395 400 

Leu Leu Pro Leu Pro Gly Thr Val Tyr Thr Val Cys Gly Gly Pro Pro 
405 410 415 

Cys Gin Gly He Ser Gly Tyr Asn Arg Tyr Arg Asn Asn Glu Ala Pro 
420 425 430 

Leu Glu Asp Gin Lys Asn Gin Gin Leu Leu Val Phe Leu Asp He lie 
435 440 445 

Asp Phe Leu Lys Pro Asn Tyr Val Leu Met Glu Asn Val Val Asp Leu 
450 455 460 

Leu Arg Phe Ser Lys Gly Phe Leu Ala Arg His Ala Val Ala Ser Phe 
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465 



470 475 480 



Val Ala Met Asn Tyr Gin Thr Arg Leu Gly Met Met Ala Ala Gly Ser 
485 490 495 

Tyr Gly Leu Pro Gin Leu Arg Asn Arg Val Phe Leu Trp Ala Ala Gin 
500 505 510 

Pro Ser Glu Lys Leu Pro Pro Tyr Pro Leu Pro Thr His Glu Val Ala 
515 520 525 

Lys Lys Phe Asn Thr Pro Lys Glu Phe Lys Asp Leu Gin Val Gly Arg 
530 535 540 

lie Gin Met Glu Phe Leu Lys Leu Asp Asn Ala Leu Thr Leu Ala Asp 
545 550 555 560 

a&la He Ser Asp Leu Pro Pro Val Thr Asn Tyr Val Ala Asn Asp Val 
% 565 570 575 

;Jet Asp Tyr Asn Asp Ala Ala Pro Lys Thr Glu Phe Glu Asn Phe He 
V 580 585 590 

ffjper Leu Lys Arg Ser Glu Thr Leu Leu Pro Ala Cys Gly Gly Asp Pro 
595 600 605 

^!thr Arg Arg Leu Phe Asp His Gin Pro Leu Val Leu Gly Asp Asp Asp 
hi* 610 615 620 

Qleu Glu Arg Val Ser Tyr lie Pro Lys Gin Lys Gly Ala Asn Tyr Arg 

Cb25 630 635 640 

l S } 

Asp Met Pro Gly Val Leu Val His Asn Asn Lys Ala Glu He Asn Pro 
645 650 655 

Arg Phe Arg Ala Lys Leu Lys Ser Gly Lys Asn Val Val Pro Ala Tyr 
660 665 670 

Ala lie Ser Phe He Lys Gly Lys Ser Lys Lys Pro Phe Gly Arg Leu 
675 680 685 

Trp Gly Asp Glu He Val Asn Thr Val Val Thr Arg Ala Glu Pro His 
690 695 700 

Asn Gin Cys Val He His Pro Met Gin Asn Arg Val Leu Ser Val Arg 
705 710 715 720 

Glu Asn Ala Arg Leu Gin Gly Phe Pro Asp Cys Tyr Lys Leu Cys Gly 
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725 730 735 

Thr lie Lys Glu Lys Tyr lie Gin Val Gly Asn Ala Val Ala Val Pro 
740 745 750 

Val Gly Val Ala Leu Gly Tyr Ala Phe Gly Met Ala Ser Gin Gly Leu 
755 760 765 

Thr Asp Asp Glu Pro Val lie Lys Leu Pro Phe Lys Tyr Pro Glu Cys 
770 775 780 

Met Gin Ala Lys Asp Gin lie 
785 790 



<210> 45 

<211> 444 

m&12 > PRT 

y?§213> Zea mays 

1=^400 > 4 5 

:|geu Asp He Phe Ala Gly Cys Gly Gly Leu Ser Glu Gly Leu Gin Gin 
B 15 10 15 

Wila Gly Val Ser Phe Thr Lys Trp Ala He Glu Tyr Glu Glu Pro Ala 
20 25 30 

^Sly Glu Ala Phe Asn Lys Asn His Pro Glu Ala Val Val Phe Val Asp 
-35 4 0 4 5 

m 

I'Spisn Cys Asn Val He Leu Lys Ala He Met Asp Lys Cys Gly Asp Thr 
50 55 60 

Asp Asp Cys Val Ser Thr Ser Glu Ala Ala Glu Gin Ala Ala Lys Leu 
65 70 * 75 80 

Pro Glu Val Asn He Asn Asn Leu Pro Val Pro Gly Glu Val Glu Phe 
85 90 95 

He Asn Gly Gly Pro Pro Cys Gin Gly Phe Ser Gly Met Asn Arg Phe 
100 105 110 

Asn Cys Gin Ser Pro Trp Ser Lys Val Gin Cys Glu Met lie Leu Ala 
115 120 125 

Phe Leu Ser Phe Ala Glu Tyr Phe Arg Pro Arg Phe Phe Leu Leu Glu 
130 135 140 



29 



Asn Val Arg Asn Phe Val Ser Phe Asn Lys Gly Gin Thr Phe Arg Leu 
145 150 155 160 

Ala Val Ala Ser Leu Leu Glu Met Gly Tyr Gin Val Arg Phe Gly lie 
165 * 170 175 

Leu Glu Ala Gly Ala Phe Gly Val Ala Gin Ser Arg Lys Arg Ala Phe 
180 185 190 

lie Trp Ala Ala Ala Pro Gly Glu Met Leu Pro Asp Trp Pro Glu Pro 
195 200 205 

Met His Val Phe Ala Ser Pro Glu Leu Lys lie Thr Leu Pro Asp Gly 
210 215 220 

Gin Tyr Tyr Ala Ala Ala Arg Ser Thr Ala Gly Gly Ala Pro Phe Arg 
225 230 235 240 

M 

i|la He Thr Val Arg Asp Thr He Gly Asp Leu Pre Lys Val Gly Asn 
S 245 250 255 

;ply Ala Ser Lys Leu Thr Leu Glu Tyr Gly Gly Glu Pro Val Ser Trp 
O 260 265 270 

h&he Gin Lys Lys He Arg Gly Ser Met Met Val Leu Asn Asp His He 
* 275 280 285 

&ker Lys Glu Met Asn Glu Leu Asn Leu He Arg Cys Gin His lie Pro 
290 2 95 300 

m 

mi 

Ijpjys Arg Pro Gly Cys Asp Trp His Asp Leu Pro Asp Glu Lys Val Lys 
'!s05 310 315 320 

Leu Ser Asn Gly Gin Met Ala Asp Leu lie Pro Trp Cys Leu Pro Asn 
325 330 335 

Thr Ala Lys Arg His Asn Gin Trp Lys Gly Cys Leu Tyr Gly Arg Leu 
340 345 350 

Asp Trp Glu Gly Asn Phe Pro Thr Ser Val Thr Asp Pro Gin Pro Met 
355 360 365 

Gly Lys Val Gly Met Cys Phe His Pro Asp Gin Asp Arg He He Thr 
370 375 380 

Val Arg Glu Cys Ala Arg Ser Gin Gly Phe Pro Asp Ser Tyr Glu Phe 
385 390 ' 395 40C 



30 



Ala Gly Asn He Gl-n Asn Lys His Arg Gin He Gly Asn Ala Val Pro 
405 410 415 

Pro Pro Leu Ala Tyr Ala Leu Gly Arg Lys Leu Lys Glu Ala Val Asp 
420 425 430 

Lys Arg Gin Glu Ala Ser Ala Gly Val Pro Ala Pro 
435 440 



<210> 46 

<211> 440 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 46 

Leu Asp He Phe Ala Gly Cys Gly Gly Leu Ser His Gly Leu Lys Lys 
i^n i 5 10 15 

jUkla Gly Val Ser Asp Ala Lys Trp Ala He Glu Tyr Glu Glu Pro Ala 
M 20 25 30 

QGly Gin "Ala Phe Lys Gin Asn His Pro Glu Ser Thr Val Phe Val Asp 
0 35 40 45 

® Asn Cys Asn Val He Leu Arg Ala He Met Glu Lys Gly Gly Asp Gin 
p 50 55 60 

\%sp Asp Cys Val Ser Thr Thr Glu Ala Asn Glu Leu Ala Ala Lys Leu 

!|j 65 70 75 80 

1% I 

M ' ? Thr Glu Glu Gin Lys Ser Thr Leu Pro Leu Pro Gly Gin Val Asp Phe 

85 90 95 

He Asn Gly Gly Pro Pro Cys Gin Gly Phe Ser Gly Met Asn Arg Phe 
100 105 110 

Asn Cys Gin Ser Ser Trp Ser Lys Val Gin Cys Glu Met He Leu Ala 
115 120 125 

Phe Leu Ser Phe Ala Asp Tyr Phe Arg Pro Arg Tyr Phe Leu Leu Glu 
130 135 140 

Asn Val Arg Thr Phe Val Ser Phe Asn Lys Gly Gin Thr Phe Gin Leu 
145 150 155 160 

Thr Leu Ala Ser Leu Leu Glu Met Gly Tyr Gin Val Arg Phe Gly He 
165 170 175 
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Leu Glu Ala Giy Ala Tyr Gly Val Ser Gin Ser Arg Lys Arg Ala Phe 
180 185 190 

He Trp Ala Ala Ala Pro Glu Glu Val Leu Pro Glu Trp Pro Glu Pro 
195 200 205 

Met His Val Phe Gly Val Pro Lys Leu Lys He Ser Leu Ser Gin Gly 
210 ' 215 220 

Leu His Tyr Ala Ala Val Arg Ser Thr Ala Leu Gly Ala Pro Phe Arg 
225 230 235 240 

Pro He Thr Val Arg Asp Thr He Gly Asp Leu Pro Ser Val Glu Asn 
245 250 255 

Gly Asp Ser Arg Thr Asn Lys Glu Tyr Lys Glu Val Ala Val Ser Trp 
Q 260 265 270 

k ibhe Gin Lys Glu He Arg Gly Asn Thr He Ala Leu Thr Asp His He 
h k 275 280 285 

J!: 

:?bys Lys Ala Met Asn Glu Leu Asn Leu He Arg Cys Lys Leu He Pro 
290 295 300 

I^JThr Arg Pro Gly Ala Asp Trp His Asp Leu Pro Lys Arg Lys Val Thr 
$%05 310 315 320 

J^Leu Ser Asp Gly Arg Val Glu Glu Met He Pro Phe Cys Leu Pro Asn 
J!! 325 330 335 

: 'Thr Ala Giu Arg His Asn Gly Trp Lys Gly Leu Tyr Gly Arg Leu Asp 
340 345 350 

Trp Gin Gly Asn Phe Pro Thr Ser Val Thr Asp Pro Gin Pro Met Giy 
355 360 365 

Lys Val Gly Met Cys Phe His Pro Glu Gin His Arg He Leu Thr Val 
370 375 380 

Arg Giu Cys Ala Arg Ser Gin Gly Phe Pro Asp Ser Tyr Glu Phe Ala 
385 390 395 400 

Giy Asn lie Asn His Lys His Arg Gin lie Gly Asn Ala Val Pro Pro 
405 410 415 

Pro Leu Ala Phe Ala Leu Gly Arg Lys Leu Lys Giu Ala Leu His Leu 
420 425 430 
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Lys Lys Ser Pro Gin His Gin Pro 
435 440 



<210> 47 
<211> 130 
<212> DNA 
<213> Zea mays 

<400> 47 

catgctgttg ggccatgugt ctagtgttgg cccattaacg tgtacacata tactagaagt 6 
gtgtgtggtg tagagagagt gctgtatgtt ttccacattc cagaaaaatc cacatggtat 1 
cagagccagg 1 



,^ ; 210> 48 
H§fell> 123 
1|212 > DNA 



*0 

, ? <213> Zea mays 
I** 



;||4 0 0 > 4 8 

!l|agggggagt gttgggccat gtgtctagtg ttggcccatt aacgtgtaca catatactag 6 
jjfagtgtgtgt ggtgtagaga gagtgctgta tgttttccac attccagaaa aatccacaca 1 




49 



14 



FRT 



Zea mays 



<40C> 49 

Cys Tyr Asn Cys Gly Asn Val Gly His He Ala Arg Asn Cys 
15 10 



<210> 50 



<211 



> 17 



<212> PRT 



<213> Zea mays 



<400> 50 

Thr Gin Val Thr Gin Leu Lys Trp He Leu Asp Ser Gly Ala Ser Lys 
1 5 10 15 



<210> 51 
<211> 14 
<212> PRT 
<213> Zea mays 

<400> 51 

Cys Gin Val Cys Ser Arg Val Gly His Thr Ala Leu Asn Cys 
15 10 



<210> 52 

<211> 17 

<212> PRT 

<213> Zea mays 

iW4 00> 52 

Jpin Asn Gly Ser Asn Val Pro Trp Tyr Thr Asp Thr Gly Ala Thr Asp 
1 5 10 15 

j:His 



& <210> 53 

\Q:2 11> 14 

Mk212> PRT 

l^<213> Oryza sativa 

Qk4 00> 5 3 

% 

MCys Gin Val Cys Phe Lys Arg Gly His Thr Ala Ala Asp Cys 
15 10 



<210> 54 
<211> 17 
<212> PRT 

<213> Cryza sativa 
<400> 54 

Ser Tyr Gly lie Asp Thr Asn Trp Tyr lie Asp Thr Gly Ala Thr Asp 
15 10 15 

His 
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<210> 55 
<211> 14 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 55 

Cys Ser Asn Cys Gly Arg Thr Gly His Glu Lys Lys Glu Cys 
15 10 



<210> 56 
<211> 17 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 56 

Gly Lys Thr Lys Leu Gly Asp lie lie Leu Asp Ser Gly Ala Ser His 
m l 5 10 15 

*£? 

|#211> 14 
;.;<212> PRT 
j|pi3> Zea mays 

hh 

1*400 > 57 

0ys His His Cys Gly Arg Glu Gly His He Lys Lys Asp Cys 
M 1 5 10 

<210> 58 
<211> 17 
<212> PRT 

<213> Drosophila melanogaster 
<400> 58 

Ser Val Met Asp Asn Cys Gly Phe Val Leu Asp Ser Gly Ala Ser Asp 
15 10 15 

His 



<210> 59 
<211> 52 
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<212> PRT 
<213> Zea mays 

<400> 59 

Gin Val Lys He Leu Arg Pro Asp Asn Gly Thr Glu Tyr Val Asn Lys 
15 10 15 

Gly Phe Asn Ala Phe Leu Ser Arg Asn Gly He Leu His Gin Thr Ser 
20 25 30 

Cys Pro Asp Thr Pro Pro Gin Asn Gly Val Ala Glu Arg Lys Asn Arg 
35 40 45 

His He Leu Glu 
50 



J <210> 60 

'M&11> 50 

^212> PRT 

J^213> Zea mays 

.:!<0 0 C > 6 0 

3Sys lie He Ala Phe Gin Ser Asp Trp Gly Gly Glu Tyr Glu Lys Leu 
rf 15 10 15 

^sn Ala His Phe Lys Thr He Gly He His His Gin Val Ser Cys Pro 
2 0 2 5 3 0 

fflis Thr His Gin Gin Asn Gly Ala Ala Glu Arg Lys His Arg His He 
35 4 0 4 5 

H ; 

; 

Val Glu 
50 



<210> 61 
<211> 51 
<212> PRT 

<213> Oryza sativa 
<400> 61 

Lys He He Ala Met Gin Thr Asp Trp Arg Gly Gly Arg Tyr Gin Lys 
15 10 15 

Leu Asn Ser Phe Phe Ala Gin He Gly Leu He He Met Cys His Val 
20 25 30 
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Leu Thr Leu He Arg Gin Asn Gly Ser Ala Glu Arg Lys His Arg Kis 
. 35 40 45 

He Val Glu 
50 



<210> 62 
<211> 50. 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 62 

Thr Val Lys Met Val Arg Ser Asp Asn Gly Thr Glu Phe Met Cys Leu 
15 10 15 

Ser Ser Tyr Phe Arg Glu Asn Gly He He His Gin Thr Ser Cys Val 
20 25 30 

hf 

Igly Thr Pro Gin Gin Asn Gly Arg Val Glu Arg Lys His Arg His He 
[Cl 35 4 0 4 5 



,eu Asn 
50 



s**210> 63 
j^211> 52 
y<212> PRT 

£1*k213> Drosophiia melanogaster 

fyc400> 63 

'^Lys Val Val Tyr Leu Tyr He Asp Asn Gly Arg Glu Tyr Leu Ser Asn 
15 10 15 

Glu Met Arg Gin Phe Cys Val Lys Lys Gly He Ser Tyr His Leu Thr 
20 25 30 

Val Pro His Thr Pro Gin Leu Asn Gly Val Ser Glu Arg Met He Arg 
35 40 45 

Thr lie Thr Glu 
50 



<210> 64 
<211> 71 
<212> PRT 
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<21-3> Zea mays 
<400> 64 

Arg Tyr Lys Ala Arg Leu Val Ala Arg Gly Tyr Ser Gin Thr Tyr Gly 
15 10 15 

lie Asp Tyr Asp Glu Thr Phe Ala Pro Val Ala Lys Met Ser Thr Val 
20 25 30 

Arg Thr Leu lie Ser Cys Ala Ala Asn Phe Gly Trp Pro Leu Tyr Gin 
35 40 45 

Leu Asp Val Lys Asn Ala Phe Leu His Gly Asp Leu Gin Glu Glu Val 
50 55 60 

Tyr Met Glu lie Pro Pro Gly 
65 70 



<2«> 65 

<25€> 12 

<2^2> PRT 

<2J5> Zea mays 

<4|J> 6 5 

Ala He Leu Ala Val Tyr Val Asp Asp He He He 
jln 5 10 



<2m)> 6 6 



<2m.> 7i 

<2Xlt> PRT 
<213> Zea mays 



<400> 66 

Arg Leu Lys Ala Arg Leu Val Ala Lys Gly Phe Lys Gin Gin Tyr Gly 
15 10 15 

He Asp Tyr Asp Asp Thr Phe Ser Pro Val Val Lys His Ser Thr He 
20 25 30 

Arg Leu Val Leu Ser Leu Ala Val Ser Gin Lys Trp Ser Leu Arg Gin 
35 40 45 

Leu Asp Val Gin Asn Ala Phe Leu His Gly He Leu Glu Glu Thr Val 
50 55 60 

Tyr Met Lys Gin Pro Pro Gly 
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65 70 



<210> 67 
<211> 12 
<212> PRT 
<213> Zea mays 



<400> 67 

lie Tyr lie Leu Val Tyr Val Asp Asp lie lie lie 
15 10 



<210> 68 
<211> 71 
<212> PRT 

<213> Oryza sativa 
; ^;400> 68 

"Ijrg Tyr Lys Ala Arg Leu Val Ala Lys Gly Phe Lys Gin Arg Tyr Gly 
l 5 10 15 

JlJle Asp Tyr Glu Asp Thr Phe Ser Pro Val Val Lys Ala Ala Thr lie 
SI' 20 25 30 

.Arg lie He Leu Ser He Ala Val Ser Arg ; .y Trp Ser Leu Arg Gin 
m 3 5 4 0 45 

j^eu Asp Val Gin Asn Ala Phe Leu His Gly Phe Leu Glu Glu Glu Val 
m 50 55 60 



fgjyr Met Gin Gin Pro Pro Gly 
65 70 



<210> 69 
<211> 12 
<212> PRT 

<213> Oryza sativa 
<400> 69 

Met Phe Val Leu Val Tyr Vai Asp Asp He He Val 
15 10 



<2 1 0 > 70 
<2il> 71 
<212> PRT 
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<213> Arabidopsis thaliana 
<400> 70 

Arg Tyr Lys Ala Arg Leu Val Val Gin Gly Asn Lys Gin Val Glu Gly 
1 5 10 15 

Glu Asp Tyr Lys Glu Thr Phe Ala Pro Val Val Arg Met Thr Thr Val 
20 25 30 

Arg Thr Leu Leu Arg Asn Val Ala Ala Asn Gin Trp Glu Val Tyr Gin 
35 40 45 

Met Asp Val His Asn Ala Phe Leu His Gly Asp Leu Glu Glu Glu Val 
50 55 60 

Tyr Met Lys Leu Pro Pro Gly 



65 


70 


■©2io> 


71 


j©211> 


12 


Nhl2> 


PRT 


!<213> 


Arabidopsis thaliana 


4 o o > 


71 



'Leu Arg Val Leu lie Tyr Val Asp Asp Leu Leu lie 
1 5 10 

1^21 0> 72 

(5*211 > 71 
;&12> PRT 
<213> Drosophila melanogaster 

<400> 72 

Arg Tyr Lys Ala Arg Leu Val Ala Arg Gly Phe Thr Gin Lys Tyr Gin 
15 10 15 

lie Asp Tyr Glu Glu Thr Phe Ala Pro Val Ala Arg lie Ser Ser Phe 
20 25 30 

Arg Phe lie Leu Ser Leu Val lie Gin Tyr Asn Leu Lys Val His Gin 
35 40 45 

Met Asp Val Lys Thr Ala Phe Leu Asn Gly Thr Leu Lys Glu Glu lie 
50 55 60 

Tyr Met Arg Leu Pro Gin Gly 
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65 ' 70 



<2I0> 73 
<21I> 12 
<212> PRT 

<213> Drosophila melanogaster 
<400> 73 

lie Tyr Val Leu Leu Tyr Val Asp Asp Val Val lie 
15 10 



<210> 74 

<211> 62 

<212> PRT 

<213> Zea mays 

^4 0 0 > 74 

'%sp Ala Asp Trp Gly Ser Cys Leu Asp Asp Arg Arg Ser Thr Ser Gly 
^ 15 10 15 

jffyr Cys Val Phe Val Gly Gly Asn Leu Val Ser Trp Arg Ser Lys Lys 

SS 20 25 30 

J31n Ser Val Val Ser Arg Ser Thr Ala Glu Ala Glu Tyr Arg Ala Met 
i^l 3 5 4 0 4 5 

|^1 a Leu Ala lie Cys Glu Met Leu Trp lie Lys Gly Leu Leu 
m 50 55 60 

<210> 75 

<211> 17 

<212> PRT 

<213> Zea mays 

<400> 75 

Asn Pro Val Gin His Asp Arg Thr Lys His Val Glu lie Asp Arg Phe 
15 10 15 

Phe 



<210> 76 
<211> 62 
<212> PRT 
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<213> Zea mays 
<400> 76 

Asp Ala Asp Trp Ala Gly Cys Pro 
1 5 

Tyr Ala Leu Phe Leu Gly Pro Asn 
20 

Gin Ser Thr Val Ser Arg Ser Ser 

35 40 

Ala Asn Ala Thr Ala Glu Val He 
50 55 




Asp Asp Arg Lys Ser Thr Gly Gly 
10 15 

Leu He Ser Trp Asn Ser Lys Lys 
25 30 

Thr Glu Ala Glu Tyr Lys Ala Met 
45 

Trp Leu Gin Ser Leu Leu 
60 



<210> 77 

<211> 17 

?J§212> PRT 

*y.213> Zea mays 

N|400> 77 

jEys Pro He Phe Asn Ala Arg Thr Lys His He Glu Val Asp Phe His 
§ 1 5 10 15 

' Phe 



rfs2lQ> 78 

S« 2 1 1 > 6 2 

|H212> PRT 

"<213> Oryza sativa 

<400> 78 

Asp Ala Asp Trp Ala Gly Ser He Asp Asp Arg Lys Ser Thr Gly Gly 
15 10 15 

Phe Ala Val Phe Leu Gly Ser Asn Leu Val Ser Trp Ser Ala Arg Lys 
20 25 30 

Gin Pro Thr Val Ser Arg Ser Ser Thr Glu Ala Glu Tyr Lys Ala Val 
35 40 45 

Ala Asn Thr Thr Ala Glu Leu lie Trp Val Gin Thr Leu Leu 
50 55 60 
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<210> 79 
<211> 17 
<212> PRT 

<213> Oryza sativa 
<400> 79 

Asn Pro Val Phe His Ala Arg Thr Lys His lie Glu Val Asp Tyr His 
i r 10 15 



Phe 



<210> 80 
<211> 62 
<212> PRT 

<213> Arabidopsis thaliana 
^4 0 0 > 8 0 

^fisp Ser Asp Trp Gin Ser Cys Pro Leu Thr Arg Arg Ser lie Ser Ala 
]p 15 10 15 

Jtfyr Val Val Leu Leu Gly Gly Ser Pro He Ser Trp Lys Thr Lys Lys 
% 20 25 30 

;.Gln Asp Thr Val Ser His Ser Ser Ala Glu Ala Glu Tyr Arg Ala Met 
m 35 40 45 

i.Jer Tyr Ala Leu Lys Glu He Lys Trp Leu Arg Lys Leu Leu 
5C 5 5 6 0 



<210> 81 
<211> 17 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 81 

Asn Pro Val Phe His Glu Arg Thr Lys His He Glu Ser Asp Cys His 
15 10 15 

Ser 



<210> 82 
<21i> 63 
<212> PRT 
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<213> Drosophila melanogaster 
<400> 82 

Asp Ser Asp Trp Ala Gly Ser Glu 

1 5 

Tyr Leu Phe Lys Met Phe Asp Phe 
20 

Arg Gin Asn Ser Val Ala Ala Ser 
35 40 

Leu Phe Glu Ala Cys Arg Glu Ala 
50 55 



lie Asp Arg Lys Se'r Thr Thr Gly 
10 15 

Asn Leu lie Cys Trp Asn Thr Lys 
25 30 

Ser Thr Glu Ala Glu Tyr Met Ala 
45 

Leu Trp Leu Lys Phe Leu Leu 
60 



<210> 83 

<211> 17 

||12> PRT 

Mpl3> Drosophila melanogaster 

W400> 83 

'Jfcm Pro Ser Cys Kis Lys Arg Ala Lys His lie Asp He Lys Tyr His 
p : i 5 10 15 

¥he 



:^210> 84 
^|2il> 1181 
jf^212> DNA 
"<213> Zea mays 

<400> 84 

gggaattcga tiactcacta tagcgctcga gcggccgccc gggcaggttc gaaaaccatc 60 

aacctaacga tgtaatggag tatggtggtt cccccaagac agagttccag cgctacattc 120 

gacttggtcg taaagacatg ttggattggt cgtttggtga ggaggctggt ccagatgaag 180 

gcaagctctt ggatcaccag cccttacggc ttaacaatga tgattatgag cgggttaagc 240 

aaattcctgt caagaaggga gccaacttcc gtgacctaaa gggtgtcaag gttggagcaa 300 

ataatgttgt tgagcgggat ccagaagtcg aacgtgtgta cctttcgtct gggaaaccac 360 

tggttcctga ctatgcgatg tcattcatca agggcaaatc actcaagcca ttcgggcgcc 420 

tgtggtggga ccagacggtt cctacagttg tgaccagagc agagcctcat aaccaggtta 480 

tattgcatcc gactcaagca agagtcttga ctatccggga gaacgcaagg ttacagggct 540 

tccccgatta ctaccgattg tttggaccga tcaaggagaa gtatattcaa gtcgggaacg 600 

cagtggcagt ccctgttgca cgggcactgg gctactgtct gggtcaagcc tacctgggtg 660 

aatctgacgg gagtcagcct ctgtaccagc tgcctgcaag ttttacctct gtggggcgaa 720 

ccgcggttca ggcgaatgcc gcttctgttg gcactcctgc gggggaggta gtcgagcagt 780 
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aaaaggatag cggagcaacc ctggttggta ttttgattcg agcccatcca gtagcatgtt 840 
taccaataaa taatcattgg tcgtgctgat tcttatggtt ggagatgaat gtatgtaggg 900 
tgtactcgag ctcgagtgct tgttgtactg taggttgagg tttctcatcc attggcctgc 960 
ctatttgtgg atgacgtttc atttcagatt agcaatgtgc ttatttaagg tttcgtcatg 1020 
tacctgtatt ctacaatcca ctattgtttc caaagacagc atttgatcct taaaaaaaac 1080 
tgtaaaaaaa aaaaaacagt gcccgaaaag ccgcaaaaaa aaaaaaaaaa aaaacctgcc 1140 
cgggcggccg ctcgagccct atagtgagta atcgaattcc c 1181 



<210> 85 
<211> 6 
<212> PRT 
<213> Zea mays 

<400> 85 

Glu Phe Asp Tyr Ser Leu 
1 5 



&210> 86 

^f211> 252 

%212> PRT 

^213> Zea mays 

!J|400> 86 

"Arg Ser Ser Gly Arg Pro Gly Arg Phe Glu Asn His Gin Pro Asn Asp 
m 1 5 10 15 

j^al Met Glu Tyr Gly Gly Ser Pro Lys Thr Glu Phe Gin Arg Tyr lie 
if\ 2 0 2 5 3 0 

f|irg Leu Gly Arg Lys Asp Met Leu Asp Trp Ser Phe Gly Glu Glu Ala 
35 40 45 

Gly Pro Asp Glu Gly Lys Leu Leu Asp His Gin Pro Leu Arg Leu Asn 
50 55 60 

Asn Asp Asp Tyr Glu Arg Val Lys Gin lie Pro Val Lys Lys Gly Ala 
65 70 75 80 

Asn Phe Arg Asp Leu Lys Gly Val Lys Val Gly Ala Asn Asn Val Val 
85 90 95 

Glu Trp Asp Pro Glu Val Glu Arg Val Tyr Leu Ser Ser Gly Lys Pro 
100 105 110 

Leu Val Pro Asp Tyr Ala Met Ser Phe lie Lys Gly Lys Ser Leu Lys 
115 120 . 125 
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Pro Phe Gly Arg Leu Trp Trp Asp Gin Thr Val Pro Thr Val Val Thr 
130 135 140 

Arg Ala Glu Pro His Asn Gin Val He Leu His Pro Thr Gin Ala Arg 
145 150 155 160 

Val Leu Thr He Arg Glu Asn Ala Arg Leu Gin Gly Phe Pro Asp Tyr 
165 170 175 

Tyr Arg Leu Phe Gly Pro He Lys Glu Lys Tyr He Gin Val Gly Asn 
180 185 190 

Ala Val Ala Val Pro Val Ala Arg Ala Leu Gly Tyr Cys Leu Gly Gin 
195 200 205 

Ala Tyr Leu Gly Glu Ser Asp Gly Ser Gin Pro Leu Tyr Gin Leu Pro 
m 210 215 220 

# 

\l%la Ser Phe Thr Ser Val Gly Arg Thr Ala Val Gin Ala Asn Ala Ala 
M2 2 5 230 235 240 

j^er Val Gly Thr Pro Ala Gly Glu Val Val Glu Gin 
ffl 24 5 2 50 

1*5:2 10 > 8 7 

11 > 246 
&%212> PRT 
H^213> Zea mays 

m 

" S <4C0> 87 

Lys Val Gin Asn His Gin Pro Asn Asp Val Met Glu Tyr Gly Gly Ser 
5 10 15 

Pro Lys Thr Glu Phe Gin Arg Tyr He Arg Leu Ser Arg Lys Asp Met 
20 25 30 

Leu Asp Trp Ser Phe Gly Glu Gly Ala Gly Pro Asp Glu Gly Lys Leu 
35 40 45 

Leu Asp His Gin Pro Leu Arg Leu Asn Asn Asp Asp Tyr Glu Arg Val 
50 55 60 

Gin Gin He Pro Val Lys Lys Gly Ala Asn Phe Arg Asp Leu Lys Gly 
65 70 75 80 

Val Arg Val Gly Ala Asn Asn lie Val Glu Trp Asp Pro Glu He Glu 
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85 90 95 

Arg Val Lys Leu Ser Ser Gly Lys Pro Leu Val Pro Asp Tyr Ala Met 
100 105 110 

Ser Phe lie Lys Gly Lys Ser Leu Lys Pro Phe Gly Arg Leu Trp Trp 
115 120 125 

Asp Glu Thr Val Pro Thr Val Val Thr Arg Ala Glu Pro His Asn Gin 
130 135 140 

Val lie lie His Pro Thr Gin Ala Arg Val Leu Thr lie Arg Glu Asn 
145 150 155 160 

Ala Arg Leu Gin Gly Phe Pro Asp Tyr Tyr Arg Leu Phe Gly Pro lie 
165 170 175 

j^ys Glu Lys Tyr lie Gin Val Gly Asn Ala Val Ala Val Pro Val Ala 

'X 180 185 190 

*! 

rj|rg Ala Leu Gly Tyr Cys Leu Gly Gin Ala Tyr Leu Gly Glu Ser Glu 
"f« 195 200 2 05 

f§ly Ser Asp Pro Leu Tyr Gin Leu Pro Pro Ser Phe Thr Ser Val Gly 
210 215 220 

^|ly Arg Thr Ala Gly Gin Ala Arg Ala Ser Pro Val Gly Thr Pro Ala 
j%£2 5 230 235 240 



3&ly Glu Val Val Glu Gin 



24 5 



<210> 88 
<211> 226 
<212> PPT 
<213> Zea mavs 



<400> 88 

Asn His Gin Pro Asn Asp Val Met Glu Tyr Gly Gly Ser Pro Lys Thr 
15 10 15 

Glu Phe Gin Arg Tyr lie Arg Leu Arg Lys Asp Met Leu Asp Trp Ser 
20 25 30 

Phe Gly Glu Ala Gly Pro Asp Glu Gly Lys Leu Leu Asp His Gin Pro 
35 40 45 
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Leu Arg Leu Asn Asn Asp Asp Tyr Glu Arg Val Gin lie Pro Val Lys 
50 55 60 

Lys Gly Ala Asn Phe Arg Asp Leu Lys Gly Val Val Gly Ala Asn Asn 
65 70 75 80 

Val Glu Trp Asp Pro Glu Glu Arg Val Leu Ser Ser Gly Lys Pro Leu 
85 90 95 

Val Pro Asp Tyr Ala Met Ser Phe lie Lys Gly Lys Ser Leu Lys Pro 
100 105 110 

Phe Gly Arg Leu Trp Trp Asp Thr Val Pro Thr Val Val Thr Arg Ala 
115 120 125 

Glu Pro Kis Asn Gin Val lie His Pro Thr Gin Ala Arg Val Leu Thr 
130 135 140 

=|lle Arg Glu Asn Ala Arg Leu Gin Gly Phe Pro Asp Tyr Tyr Arg Leu 
?J$4 5 150 155 160 

.-||he Gly Pro lie Lys Glu Lys Tyr lie Gin Val Gly Asn Ala Val Ala 
Q 165 170 175 

Mfal Pro Val Ala Arg Ala Leu Gly Tyr Cys Leu Gly Gin Ala Tyr Leu 
180 185 190 

f^fely Glu Ser Gly Ser Pro Leu Tyr Gin Leu Pro Ser Phe Thr Ser Val 
;^ 195 200 205 

;;^ly Arg Thr Ala Gin Ala Ala Val Gly Thr Pro Ala Gly Glu Val Val 
'■^ 210 215 220 

Glu Gin 
225 



<210> 89 
<211> 245 
<212> PRT 
<213> Zea mays 

<400> 89 

Arg Phe Glu Asn His Gin Pro Asn Asp Val Met Glu Tyr Gly Gly Ser 
15 10 15 

Pro Lys Thr Glu Phe Gin Arg Tyr lie Arg Leu Gly Arg Lys Asp Met 
20 25 30 
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Leu Asp Trp Ser Phe Gly Glu Glu Ala Gly Pro Asp Glu Gly Lys Leu 
35 40 45 

Leu Asp His Gin Pro Leu Arg Leu Asn Asn Asp Asp Tyr Glu Arg Val 
50 55 60 

Lys Gin He Pro Val Lys Lys Gly Ala Asn Phe Arg Asp Leu Lys Gly 
65 70 75 80 

Val Lys Val Gly Ala Asn Asn Val Val Glu Trp Asp Pro Glu Val Glu 
85 90 95 

Arg Val Tyr Leu Ser Ser Gly Lys Pro Leu Val Pro Asp Tyr Ala Met 
100 105 110 

Ser Phe He Lys Gly Lys Ser Leu Lys Pro Phe Gly Arg Leu Trp Trp 
115 120 125 

HSsp Gin Thr Val Pro Thr Val Val Thr Arg Ala Glu Pro His Asn Gin 

h Q 130 135 140 

:ff?al He Leu His Pro Thr Gin Ala Arg Val Leu Thr He Arg Glu Asn 

Hj 45 150 155 160 

1'Ala Arg Leu Gin Gly Phe Pro Asp Tyr Tyr Arg Leu Phe Gly Pro He 
j^. 165 - 170 175 

U$>YS Glu Lys Tyr He Gin Val Gly Asn Ala Val Ala Val Pro Val Ala 
m 180 185 190 

j\4rg Ala Leu Gly Tyr Cys Leu Gly Gin Ala Tyr Leu Gly Glu Ser Asp 
195 200 205 

Gly Ser Gin Pro Leu Tyr Gin Leu Pro Ala Ser Phe Thr Ser Val Gly 
210 215 220 

Arg Thr Ala Val Gin Ala Asn Ala Ala Ser Val Gly Thr Pro Ala Gly 
225 230 235 240 

Glu Val Val Glu Gin 
245 



<21Q> 90 
<211> 12812 
<212> DNA 
<213> Zea mays 
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<400> 90 

tacatcaata aaataagggg cgccaacgca attgtccctt gttttttcta acttaaagtt 60 
caagcggcaa tgtcgcatct gatgtatgaa tatcaatttg taagtactaa acatagtctt 120 
aattccaaaa ttaattacac aa-caaagact aaattgtaag caaacctttc aagtctaatt 180 
aattcataat tacaaatgtt attgtaacat catgttaccg aatcataaac taaccaggtt 240 
cccatgtgta attagtttta taattatatt atatttaata tttgtaacta attgatgtga 300 
cagtactaaa attaagcctc ttaagccaaa aaatccacat attttagatt taaaatttga 360 
aaacagacgt atcggctaga agagccctgt cactgtcagc taatcaatta caagaagtgg 420 
cccatactag ttccatcacc agtccagtag tccaccaccc caccctacag ctgggtcatc 480 
tggcacgggt ggaggggcca acggccaaaa gcgccgcgca cttccggcgg gcaccctcgc 540 
ggagtcgcgg gtgacagcga aatttcaaat ccataccctc ccgctgcaga cgggccccac 600 
gccgtcaaaa tttggacgct cccgctccct cgatcttttg ggtttcgttt tcccagttcc 660 
caccctctct tccaccctgc cctgtttcca gatttgaccg atccccttcg attcgatttc 720 
tacacccacg gtgtccagac tccagagcac tcactctcct ggcaaacccc tttcgtcttc 780 
ccaaccctag agagcagcag cagctaccgc agcccctgcc atggcgccga gctccccgtc 840 
acccgccgcg cctacacgcg tctctgggcg gaagcgcgcc gccaaggccg aggagatcca 900 
ccagaacaag gaggaggagg aggaggtcgc ggcggcgtcc tccgccaagc gcagccgcaa 96 0 
|}||cggcatct tccgggaaga agcccaagtc gccccccaag caggccaagc cggggaggaa 102 0 
^agaagggg gatgccgaga tgaaggagcc cgtggaggac gacgtgtgcg ccgaggagcc 108 0 
=4feacgaggag gagttggcca tgggcgagga ggaggccgag gagcaggcca tgcaggagga 114 0 
s^tggttgcg gtcgcggcgg ggtcacccgg gaagaagagg gtggggagaa ggaacgccgc 12 0 0 
4gccgccgct ggcgaccacg agccggagtt catcggcagc cctgttgccg cggacgaggc 1260 
'tpbgcagcaac tggcccaaag cgctacggcc gcagcacttg ccgcaaaaga agtacattat 1320 
%ttctcccag ctctggtttt gatttgacca gatttttact ccatgtctgt tagtacttgc 1380 
'§agctgagca atctgctatt tgctgattta ttgtgcgtgc agaccggatg aggaggaaga 144 0 
■QCtcaaggcc agatgtcact accggagcgc taaggtggac aacgtcgtct actgcctcgg 1500 
;§gatgacgtc tatgtcaagg tccttgttca tcgctttctg ttgcttctgc tctcatttat 1560 
"Satgtgcata tgtgtttgtt aaggaagcaa gaattgcttg atttttgttg ccgactcgca 1620 
l^ttccgtgac gagttctgcg tatggtcacc ggtacgtggc actgatacac aacgtggtat 1680 
l^ctggaagtc tggtagtata ttttgcatcg accaggaggt ccagatcgat atgtgcggta 1740 
-SSagtgcttat ttgattgcac cctgttcgga gattcatgca tgatggcgtg tttagatgac 1800 
"gcctcccaga cagctgcctg ccaggcagct gattctggcc caggcgtccg gaatggtgaa 1860 
gttgcgctgg caagattctc aggccaccta ccaaatatgc cctggagcat attgcatgct 1920 
tctttttttg ttctctttcc ttctatattt atctcattgt tagtgaagtt tcacattgca 1980 
cgtgtcatgg aatatttact ttcaaatcaa cgaggagatg ctagcattga ggtgtgtgat 2040 
aattattaca tactagaaga tatcgtgcat gttgccattg ggattgcgaa gaatgtggaa 2100 
atgatgttgg tagcttgtat taagaggtta acagttagtg ggatgacatg actattagtt 2160 
agagatgatg tggatagtaa gtgggatatg atgtagatga cttgtgtgtt gagacagaac 222 0 
tataacatgg agttggaaat gggagcagca tggtcaaaca taccctaaat gcctgtctct 2280 
acacaatgtg gtgattggtg tatagtctgg tgttaaaagc tggatacctt gattcxgttg 2340 
aagattgtca cacccgaatt taaggacaaa tctagataca tctcatatgt gcaccaggat 2400 
agtgtataga taccaatgtc ataatcttta ttacacgacg ataatgtctt acaaaatatc 2460 
tggtgttaca agatgcacct ttcaacatgt tta.-.rgctgc aaactgtttt aattaaacag 252C 
aatgcagtgt tttgaacaaa aaaatgctgc tttatcctgc atcttgtttt gcatgtgtca 2580 
gcaattgctt acattccatt atgatctctg agattcttta aauttctagc acgatgaaag 2640 
tatttactaa ttcaactgaa cacaaacatt gtttgaatga acaaggcaac' acggatgctt 2700 
ggaataatgg ttgtgtacaa tatcacttag tggttttgct ctcacaccac atctttcatg 2760 
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ggttctttaa taatagttac tgactttaag tttcttattc ctttttgtct atcttaggct 2820 
ggagaaaacg aggcagatta cattggccgc attactgaat tttttgaggg gactgaccag 2880 
tgtcactatt ttacttgccg ttggttcttc cgagcagagg acacggtgtg tatttagtat 2940 
tttgtcattc tatgcatgtg tggatttttc tggaatgtgg aaaacataca gcactctctc 3000 
tacaccacac acacttctag tatatgtgta cacgttaatg ggccaacact agacacatgg 3060 
cccaacactc cccctcaaga tgggcgatag atatcaatca tccccatctt gctacataac 3120 
acatcacact cttttactcc tataccctta gtcaagcaat ctgctatttg accttttgag 3180 
tttacatgat tcaactctaa agtaccatta tctaacttct ctttgatgaa gaatcgatca 3240 
atzttccacat gttttgttct atcatgttga actggattgt tagctatatt catggctgac 3300 
ttattatcac accataactt cagggagtct tttcttaata cattcaactc tgataagaga 3360 
ccctttatcc atagcatctc acatatcgca agggccatag ctcggtattc tgcttcggcg 3420 
gtggaacggg ataccacaga ttgtttcttg cttctccatg atactaaatt tcctccaaca 3480 
aacacacaat atcctgaagt tgaccttcta tcatcaaggc aactacccca gtctgcatca 3540 
gagtaacctt ccacctttag atgaccatga cctttaaaga ttattccctt tccaggacaa 3600 
gtcttcaagt atcgcagtat acgatacact gcatcaagat gtccacttct ggggtcatgc 3660 
atatatcgac tcaccacact gactgcatat gtgatatcag gtcttgtatg gcacaagtag 3720 
atgagccgtc caacaagtct ttgatacctt tctttattca caggatcacc agattcagca 3780 
*wjataatttat gattcaagtc gataggtgtt gctacaggcc gacaccccaa catacctgtt 3840 
3f catcaagta gatctaaaac atatttcctt tgggagagaa ctattccttt tggagatcga 3900 
Ijjcaatctcaa taccaagaaa gtatttgaga tgaccaagat ctttaacctc aaattcctta 3960 
ypttagattct tctttagaca tgcaatctca agatcggcat cacctgtaat aataatatca 4020 
J&ccacataca cagctagaat tgcaattcgt cgtccaaagt gttgataaaa aacagtgtga 4080 
CJctccgttgc attgtttata tcccatgcta catattgcac gtctaaatct gtcaaaccat 4140 
G^ctcttgggg actgcttgag accatacaat gattttttca atcgacaaac tttcccaatt 4200 
Mgtctcaggct ttgacaatcc aggagggatc tccatataga cctcctcttg caaatcacca 4260 
u tgtaagaaag cattcttaac atctagttga tacaagggcc atccaaaatt tgcagcacaa 4320 
Qgagatcaatg tccntacagt actcattttt gccactggtg caaatgtctc atcataatca 4380 
Mattccatatg tttgactara ccctcttgca accaatcttg ctttatatcg ttctaccctt 4440 
Nbcttctgggt tttgcttcac agtgaatacc catttacaac taactgcctt ctttccttta 4500 
Olggtagtttct caaattccca agtttgattt ttttctagag ctttaagctc ctccaacatt 4560 
^gcctcacgcc agttagaatt acattgtgct tctttccaat ctcttggaat tgctacggaa 4620 
Httgcaatgatg caacaaatgc tctatatgat ggtgacaaag acgcatatga gacataattg 4680 
ctaatgtcac gttcatatcc ataccttgtt ggggggactc cagctttagc acgcgctcct 4740 
tttcgtattg caatgggcaa atcataagtg tcataatctt cagtttctcc atgagacgtc 4800 
aaaggtacat ttatagcctc taatgtgttt ggagagaact gctcagtact tgatgctgaa 4860 
ttggtttcag gagcctgagg ttgcacatgg gactttcttc ttgtatatac ttcgccctta 4920 
tatcgtaagt cgcctccaca agatttatta ttctcgtgac taggatgtgt ctccaattca 4980 
cttggcatta cttgcatctt ttgagaagca ccaatcacca cttccatttt atttggttgt 5040 
tgttccattg aatcaaccat tctgttctcc ccctctcgac tagcttcatc tgtgctagta 5100 
gagacagaat caagaaaaaa atttagatcc gtcttctcac catagaaagg cacagtctct 5160 
ctaaatgtaa catccatgct tacaaacaaa cgtcgttcac taggactcca acacttgtat 5220 
cccttttgcc ctgcaggata tccaacaaaa atgcacttca cagcacgagg atccaacttc 5280 
cccacctgag gtctatgatc tctgacaaaa catgtacatc caaaaatttt aggtggaacc 5340 
acaaacttat tctcaccgag aagaatctca catggagtct tcattgcaag tatttttgaa 5400 
ggagtgcgat taataagata tgtggcagtc aatacagctt cactccatag gaacttcgga 5460 
acattcattg taaacatcag cgaacgagca acttccaaaa tgtgacgatt cttcctttca 5520 
gccacaccat tttgtggagg tgtatcagga caggatgtct gatgtaatat accatttctt 558C 
gacagaaatg cattaaatcc cttgtttaca tactcggttc cattgtctgg tcttaggatt 5640 



51 



• 



ttgacttgag tattgaattg attctcaact 
acttcatctt tatgcttcat cacatagacc 
gtaacaaagt acttcatccc attaatagaa 
actagcacaa aaggagatat actcctgata 
tttgcaaact cacaggcatc acacaatagc 
ggaaaagctt tgcatatctt atcaaaagaa 
actgcaacct ccttctcttc cattcttgtt 
ccctcatgat ccatatacca caatccatta 
gtttccctct cctgaattaa acaattatct 
tcaaccaagg cacttagtga tatcaagttg 
gacaacttaa tagatggagt gcattgcact 
ccatcagcag tttgtataat ttctttacgt 
tcactggacg tgcctgtgac atgctttgat 
gtgacctgtg tgggtacaaa agcatgagca 
acaaaatccc ctgtgtgaga ctcctgatct 
aactttgttt catcttcgtg ttccataaat 
ttcgcccttg cccaactcct gcctccacga 
j^tcccacgat taactttgga aggcttagaa 
i|_f aattgtaac attctctagt atctttggtt 
i|gftttgagaac tttctctcat cactttgagt 
^gcttcttgta ggctaggaag agtggattga 
s |f ctgagttta gcccccttag caattgaagt 
jjpaagcaacac actctgagtg tggtagctca 
;JJaaacattgta actcctgaac gtactccgcc 
Maggcgatctt cagtctccac catcaacata 
u agtgcc ttcc acatttctgc agcacttatg 
^tcatagaac tcaacatcca cgctgccact 
l^tcatcactita acntatcctt gggctcaacg 
^tttgccttca ataatcgcaa tgctcttcta 
Mjaacttaatct catttggcat taggtctatc 
!jftctttaartg atgatggagc ctcatccctt 
^fegaacctgct caattccttg attttccccc 
cacttgggca gctgcgtcaa gatctgggtc 
ctcctcttcc tcctcctccg agctggatgg 
ggagcagcag cacctgtgtg ccggcagctt 
gagagcctcc caagcacccc tatctccaga 
gtccttggcc gccctcgcct tgtcggcggt 
tccctggcct cctgcgcggc ctccctgctg 
gtcacgtccc tcgcctccct cgatcgctcg 
tgatctcctt ctcggtggtc ttctccgtcg 
tcgccgttga gcctggctct gataccatgt 
cactctctct acaccacaca cacttctagt 
gacacatggc ccaacagcat gtcaagtggc 
caattcrntg gtgtccataa gtgtggatgg 
ttctgaggaa aagaacgaca atgtgcttga 
tgttgatcca aatgtaagtt tgctgcagtt 
ataatgtttc tgactaccat tgttttgttg 
aggctcagct gatagagagt tgcgacctat 



agtgcacaaa aattttgaaa acacttcaat 5700 
caagtcattc cgagaaaaac aatcgataaa 5760 
gtcacaggac atgtccaaac atcagaatga 5820 
cctcgactaa tataagatgt ccttgtgtgt 5880 
ttgcttttat ccaccccact cattacatca 5940 
agatgcccta atctacaatg caagagcatc 6000 
gccagcatag tgcatattgt accattagtc 6060 
cgcctggtag ctgtcccaag tctcttccct 6120 
cgatcaagaa taatacgaca atccaattga 6180 
acaggaaagg ttggcacata caaaactgat 6240 
gtgccaacac ccttgatggg ttgtggtgta 6300 
gtggggggat atcttatata tgatgtaaat 6360 
gctcctgagt ctaaaatcca ttttaactgt 6420 
taattacctt catcagtgta ggcgaagtgg 6480 
ttatctccag agatttgatt tttcttcctc 6540 
gtttcaagtt cttcttgtgt agttgctgca 6600 
cctctgccgc ctctaggagc ccctcttcct 6660 
caattacgtg caatatgtcc aacattacca 6720 
ctcatagctg aaaacacagg atgaggcggc 6780 
cttgactcct cctgggatat ggcagctatg 6840 
tgaaacatgg aggcacgtct tccctcgaac 6900 
acacgtcttt tttccaccca tttcttcgcc 6960 
ataggatcat aatgatcaac atcagcccat 7020 
acagatcgct ccccctgttt gatattatgg 7080 
acatttccag ctcccgagta catttcttca 7140 
attgtatcaa cagtgctagc aattgctgga 7200 
aaagagttta tagcatccca gtctttccat 7260 
acatctcctt taacatagcc ctcgagtctc 7320 
gaccatgcca aataattttt caccccttct 7380 
ttctgaactg gttctatatg agcaacattg 7440 
tctgctgaca gtaattcgac caatttacca 7500 
ataatatata tgtaataata actaggaaac 7560 
acaacgtcta gaagccagga ccaggagcgc 762 0 
atctcagtca caggacgcgg gcagcagggg 7680 
ccccaagggt tggacgagct gcggcagctg 7740 
tccttgtcgc cgacgagtgc ccgcgtccac 7800 
ggcgtcctct gtgctgtggc tcgggacctg 7860 
gcgtcggtgt actcgcccgt cttctgcctg 7920 
tgtgcctcgg cggcctcctt cggccgtcgc 7980 
aggccgaaga cactcgtcac cgcgacgcca 8040 
ggatttttct ggaatgtgga aaacatacag 8100 
atatgtgtac acgttaatgg gccaacacta 8160 
atagcactca catttgtcat ggcaggttat 8220 
ccacaagcat gaccctagac gtgtttttct 8280 
ttgcattatc tccaaggtca agatagtcca 8340 
tgctgagagc tttgtggttt tgctatacac 8400 
cctacttgcc rtagatggat ccaaaagcca 8460 
actatgacat gtcttactcc gttgcatatt 8520 
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ctacatttgc taatatctcg tctggtaatt 
ctgaatgcag ttagctttgc caaagagtta 
ggatgtcaat aatgtccaca aactctgaaa 
cagtaaaacc tgtcagatac ttgggttttg 
ctcgtcaagg gatgtcaatg acgtgttgat 
ctaatctaca caaacagctt atgtaatgta 
cttaggttta taggattagc gtttagtgca 
gccgagctcc tggctagaca gccatcctgg 
gccacgtttg cactagcagg ttttcctggg 
tataactgaa ttggtgctca cctttgtcta 
atgcaactcc atactcaata gtcaatatga 
tgaatcctgg caggggcaat caaataaaat 
agcacatgag ggagagtgtt gaattataag 
ctggtttgtg catgtaactc aatatgatat 
ttgatgcaat ttaataacat atttttttgg 
attgcacaca tttccttata gcttaggttt 
ataactaaag ttgatctgga cagtctacag 
^tattttta caaacgctgg cacttttttc 
'Ifbcgcttcgg gtatttcttc tgatgatgtg 
,|^gacagcaa cccttcttga tctgtattct 
jjjtgggtgcag ctctttctgg cttgaaactt 
'^ggatagaai atgttcacga tctcagaact 
;&tgttgattt caacagtttt gcgtgccaaa 
^atggatagt aaacttcatc ttggattcca 
ij^tggattttt ggatgtaggt gcgaaatgag 
^gaatgggcag ttctatgcaa aaaatatgtc 
fgjaggatcaag cggatgaaga cagccctctt 
|#tcgggatat gttatggtgg cagtgacagg 
Hgtcatttgt tcatttctac ttgattccaa 
^pgatggatar ttctgtatat tttgctgtaa 
^tacggccct gaggaggata catgggaacc 
s^tcgtctgct tgttgccttg tatacctatt 
actgcccgca gaaaattaga gaatttgtac 
tgcctgtgag tatttagttc gttgtgattt 
atttggtgat atctgcctat tttattcttt 
gcccaccatg ccaaggtatc agtgggttta 
aagatgagaa aaacaaacaa atggtgactt 
agtatgttct catggaaaat gtggtggaca 
aatatgcttt gagcugcctt gttgctatga 
ctggttgcta tggtctgcca cagttcagga 
ccatggtctg ttctgtacct tgctgtttta 
ggctgctggc taaccaggtg tacgtgtatt 
tgcccaacta tgatgttgta gtacgtggag 
cacaaaccac tactatgaaa tcatgtggaa 
ttgcagcaat gtatggttgc atatgacgag 
cttcttggcg atgcaatttc agatttacca 
tctcagtgag catgctattt aactcttctc 
tgatggagta tggtggttcc ccaagaccga 



ccttctgcat catctttttt ggttgactag 8580 
aatacatgag ttgttcctgc actcgaaaag 8640 
atgtattttt agatacttaa cttgttaagt 8700 
ggtacgatta ccatccttat gtgagtaaaa 8760 
tgtgtattag atattctgtt tgttcgaaag 8820 
aaacctcaaa caaacttgcc tcttcataag 8880 
tgtaaggcct atttgcttca cggcctccct 8940 
ccgtaggtgc ccgaaatcga acacctggga 9000 
tgcaaaccaa acacgcctat agtgttcaag 9060 
atagcttaag tttttggttt tcatcggtgc 9120 
tatagtgttc aagcatagaa ctctcgagtt 9180 
aattgcagct tacccctatt tctacgtttg 9240 
tgtgttctcc atctttctct aacagatgaa 9300 
ttgagtcaaa tgtttacttt aaaatcatag 9360 
tctcgtgtga gggagtgtac gtataactga 9420 
ttgactgcaa ctgttggtgc atgtagctca 9480 
tgaataagtt tgacacttgt aaaatgtgca 9540 
ctaatagaaa atgggcagtc aggcagtgat 9600 
gatctggaga cgtcatctag tatgccaacg 9660 
ggctgtgggg gcatgtctac tggtctttgc 9720 
gaaactgtaa tcttctaact agtcatctgt 9780 
tattctattg ttctggcttg cagcgatggg 9840 
gtttaaaata taatcatcca cagactgagg 9900 
tctgttctgt cagctactct tacaaagtgt 9960 
aaagccgatg agtttcttgc cctccttaag 10020 
caagatgtgg attcaaattt agcaagctca 10080 
gacaaggacg aatttgttgt agagaagctt 10140 
gaaaatggca tctattttaa ggtacttcag 10200 
caaaaaaatc aattacttaa gcctgtcaaa 10260 
cgctagattt ctgcaggtcc agtgggaagg 10320 
gattgataac ttgaggttag tgtatggtat 10380 
tgcatctaat ccttgttttg caaaccagtg 10440 
aagaagggca caaaagaaag attctcccac 10500 
tgctcgctat ttgtttagct cccctttttt 10560 
caaagggtga tgttgatgtc atttgtggag 10620 
atcggtacag aaaccgtgat gagccactca 10680 
tcatggatat tgtggcgtac ttgaagccca 10740 
tactcaaatt tgcggatggt tacctaggaa 10800 
agtaccaagc gcggcttgga atgatggtgg 10860 
tgcgtgtgta cctctggggt gctctttctt 10920 
tatgcttcgc tagattcata ttgcactgtt 10980 
tgacaattta ggtgctccct aagtatcctc 11040 
gagcccctaa tgcctttcgg tgagtgcaat 11100 
tgtgtaaaat acgctgacca actgaatttg 11160 
acacaaaaac catccctgaa aaaagccttg 11220 
aaggcaagtg ttctgtcaag ttcatgcatt 11280 
tgcaggttca aaatcaccag cctaacgatg 11340 
attccagcgc tacattcgac tcagtcgtaa 11400 
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aggtaaaaaa ccccgtgaac tactactggt tggccttcac tacgaatatg ttaggattta 11460 

atttcagaag aaccgccttt tttttcttgg tgcttcggta ctactgcagc aagctcactc 11520 

ttattatcat gtcagacatg ttggattggt ccttcggtga agggggctgg tccagatgaa 11580 

ggcaagctct tggatcacca gcctttacgg cttaacaacg atgattatga gcgggttcaa 1164 0 

cagattcctg tcaagaaggt tggtggcttg gtcgcatttg tgccttcctt tgttgttttt 11700 

tccccttctg aaacaatcat ctctctttcc tatgacaggg agccaacttc cgcgacctaa 11760 

agggcgtgag 99ttggagca aacaatattg ttgagtggga tccagaaatc gagcgtgtga 11820 

aactttcatc tgggaaacca ctggtatgtg tgctatttcc gtgctgttgt ttcctataac 11880 

tgtgcaacat ttactttccc atattcaaac tcataactga cgagatgctg caactactgt 11940 

aagattcatg gctaacccat gacaacattt tgcacacatc tttgttatct aggttcctga 12000 

ctatgcaatg tcattcatca agggcaaatc actcaagtaa gtttcaaaac attttttttg 12060 

ttttttgggg gaaaagtagg ttattgttta cttgtgctta catatgatgt tgcaggccgt 12120 

ttgggcgcct gtggtgggac aagacagttc ctacagttgt aaccagagca gagcctcaca 12180 

accaggtcag cttcagaaag gccactcctt ttcgccaatc cctgcatctg tatttactat 12240 

tagcgtgtgt tcccatatga tcattaccga acatgttgtc cacacaggtt ataattcatc 12300 

cgactcaagc aagggtcctc actatccggg agaacgcaag gttacagggc ttccccgatt 12360 

attaccgatt gtttggcccg atcaaggaga agtaagttcc tgttttcaag ttgcctgtac 12420 

cagatctagt cactattgaa agttttcagc agcaagccat tcatcagtta gttacagctc 12480 

Hrtgaaagcct tacctctgaa catgtgtgct ttctctgatg gtgataggta cattcaagtc 1254 0 

;||iggaacgcag tggctgtccc tgttgcccgg gcactgggct actgtctggg gcaagcctac 12600 

^tgggtgaat ctgaggggag tgaccctctg taccagctgc ctccaagttt cacctctgtt 12660 

fsfgaggacgca ctgcggggca ggcgagggcc tcttcctgtt ggcacccctg caggggaggt 12 72 0 

^gttgagcag taaaaggatg acagatctga gctgagctgg gcaacatcca gcggcaggag 12 78 0 

l^ktttctggt tcggttcgat tcgggctcac ga 12812 
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